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AMINO POLYOL AMINE OXIDASE POLYNUCLEOTIDES AND RELATED 
POLYPEPTIDES AND METHODS OF USE 

5 

Technical Field 

The present invention relates generally to the detection and isolation of fumonisin 
and API degrading enzymes and to compositions and methods for degradation of 
10 fumonisin, a structurally related mycotoxin, or its hydrolysis product APL This method 
has broad application in agricultural biotechnology and crop agriculture and in the 
improvement of food grain quality. 

Cross Reference To Related Application 

15 

This application is a continuation-in-part of U.S. Application No. 09/352,159, 
which claims the benefit of U.S. Provisional Application No. 60/135,391, filed May 21, 
1999 and U.S. Provisional Application No. 60/092,936, filed July, 15 1998 all of which are 
hereby incorporated by reference. This application also claims the benefit of U.S. 
20 Application No. 09/352, 1 68, which is hereby incorporated by reference. 

Background of the Invention 

Fungal diseases are common problems in crop agriculture. Many strides have been 
made against plant diseases as exemplified by the use of hybrid plants, pesticides and 

25 improved agricultural practices. However, as any grower or home gardener can attest, the 
problems of fungal plant disease continue to cause difficulties in plant cultivation. Thus, 
there is a continuing need for new methods and materials for solving the problems caused 
by fungal diseases of plants. 

These problems can be met through a variety of approaches. For example, the 

30 infectious organisms can be controlled through the use of agents that are selectively 
biocidal for the pathogens. Another method is interference with the mechanism by which 
the pathogen invades the host crop plant. Yet another method, in the case of pathogens 
that cause crop losses, is interference with the mechanism by which the pathogen causes 
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injury to the host crop plant. Still another method, in the case of pathogens that produce 
toxins that are undesirable to mammals or other animals that feed on the crop plants, is 
interference with toxin production, storage, or activity. This invention falls into the latter 
two categories. 

5 Since their discovery and structural elucidation in 1988 (Bezuidenhout et al, 

Journal Chem Soc, Chem Commun 1988: 743-745 (1988)), fumonisins have been 
recognized as a potentially serious problem in maize-fed livestock. They are linked to 
several animal toxicoses including leukoencephalomalacia (Marasas, et al, Onderstepoort 
Journal of Veterinary Research 55: 197-204 (1988); Wilson, et al, American Association 

10 of Veterinary Laboratory Diagnosticians: Abstracts 33rd Annual Meeting, Denver, 
Colorado, October 7-9, 1990, Madison, Wisconsin, USA) and porcine pulmonary edema 
(Colvin, et al } Mycopathologia 117: 79-82 (1992)). Fumonisins are also suspected 
carcinogens (Geary W (1971) Coord Chem Rev 7: 81; Gelderblom, et al, Carcinogenesis 
12: 1247-1251 (1991); Gelderblom, et al, Carcinogenesis 13: 433-437 (1992)). Fusarium 

15 isolates in section Liseola produce fumonisins in culture at levels from 2 to >4000 ppm 
(Leslie, et al, Phytopathology 82: 341-345 (1992)). Isolates from maize (predominantly 
mating population A) are among the highest producers of fumonisin. (Leslie et aL, supra). 
Fumonisin levels detected in field-grown maize have fluctuated widely depending on 
location and growing season, but both preharvest and postharvest surveys of field maize 

20 have indicated that the potential for high levels of fumonisins exists (Murphy, et al, JAgr 
Food Chem 41: 263-266 (1993)). Surveys of food and feed products have also detected 
fumonisin (Holcomb, et al, JAgr Food Chem 41: 764-767 (1993); Hopmans, et al, JAgr 
Food Chem 41: 1655-1658 (1993); Sydenham, et al, JAgr Food Chem 39: 2014-2018 
(1991)). The etiology of Fusarium ear mold is poorly understood, although physical 

25 damage to the ear and certain environmental conditions can contribute to its occurrence 
(Nelson, Mycopathologia 117: 29-36 (1992)). Fusarium can be isolated from most field 
grown maize, even when no visible mold is present. The relationship between seedling 
infection and stalk and ear diseases caused by Fusarium is not clear. Genetic resistance to 
visible kernel mold has been identified (Gendloff, et aL, Phytopathology 76: 684-688 

30 (1986); Holley, et al, Plant Dis 73: 578-580 (1989)), but the relationship to visible mold 
to fumonisin production has yet to be elucidated. 
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Fumonisins have been shown in in vitro mammalian cell studies to inhibit 
sphingolipid biosynthesis through inhibition of the enzyme sphingosine N-acetyl 
transferase, resulting in the accumulation of the precursor sphinganine (Norred, et aL, 
Mycopathologia 117: 73-78 (1992); Wang, et aL, Biol Chem 266: 14486 (1991); Yoo, et 
5 aL, Toxicol Appl Pharmacol 114: 9-15 (1992); Nelson, et al, Annu Rev Phytpathol 
31:233-252 (1993)). It is likely that inhibition of this pathway accounts for at least some 
of fumonisin's toxicity, and support for this comes from measures of sphinganine: 
sphingosine ratios in animals fed purified fumonisin (Wang, et aL, J Nutr 122: 1706-1716 
(1992)). Fumonisins also affect plant cell growth (Abbas, et aL, Weed Technol 6: 548-552 
10 (1992); Vanasch, et al, Phytopathology 82: 1330-1332 (1992); Vesonder, et aL, Arch 
Environ Contam Toxicol 23: 464-467 (1992)). Kuti et al, (Abstract, Annual Meeting 
American Phytopathological Society, Memphis, TN: APS Press 1993) reported on the 
ability of exogenously added fumonisins to accelerate disease development and increase 
sporulation of Fusarium moniliforme and Fusarium oxysporum on tomato. 
15 Enzymes that degrade the fungal toxin fumonisin to its de-esterified form (e.g. API 

from FBI) have been identified in US patent no. 5,716,820, issued February 10, 1998, US 
patent no. 5,792,931, issued August 11, 1998; US patent no. 6,025,188, issued February 
15, 2000; and pending US application no. 08/888,950, filed July 7, 1997, and all hereby 
incorporated by reference. It is understood that API as used here designates the 
20 hydrolyzed form of any fumonisin, FBI, FB2, FB3, FB4, or any other API-like 
compounds, including synthetically produced API like compounds, that contain a C-2 or 
C-l amine group and one or more adjacent hydroxyl groups. Plants expressing a 
fumonisin esterase enzyme, infected by fumonisin producing fungus, and tested for 
fumonisin and API were found to have low levels of fumonisin but high levels of API. 
25 API is less toxic than fumonisin to plants and probably also to animals but contamination 
with API is still a concern (Lamprecht, et aL, Phytopathology, 84:383-391 (1991)). The 
preferred result would be complete detoxification of fumonisin to a non-toxic form. 
Therefore enzymes capable of degrading API are necessary for the further detoxification 
of fumonisin. 

30 The present invention provides newly discovered polynucleotides and related 

polypeptides of amino polyol amine oxidase (abbreviated APAO, formerly known as API 
catabolase, US patent no. 5,716,820, supra, US patent no. 5,792,931, supra; US patent no. 
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6,025,188, supra, pending US application no. 08/888,950, supra; trAPAO is the 
abbreviation for a truncated, but still functional APAO), capable of oxidatively 
deaminating the API to a compound identified as the 2-oxo derivative of API or its cyclic 
ketal form (abbreviated as 2-OP, formerly called AP1-N1, US patent no. 5,716,820, US 

5 patent no. 5,792,931, US patent no. 6,025,188, supra; pending US application no. 
08/888,950, supra), isolated from Exophiala spinifera, ATCC 74269. The partially 
purified APAO enzyme from Exophiala spinifera has little or no activity on intact FBI, a 
form of fumonisin. However, recombinant APAO enzyme from Exophiala spinifera, 
expressed in E. coli, has significant but reduced activity on intact FBI and other B-series 

10 fiimonisins. APAO or trAPAO thus could potentially be used without fumonisin esterase 
since the amine group is the major target for detoxification. Alternatively, fumoninsin 
esterase and APAO (or trAPAO) can be used together for degrading toxins. 

APAO is a type of flavin amine oxidase (EC 1.4.3.4, enzyme class nomeclature, see 
Enzyme Nomenclature 1992, Recommendations of the Nomenclature Committee of the 

15 IUBMB on the Nomenclature and Classification of Enzymes, Academic Press, Inc. 
(1992)). One class of flavin amine oxidases in mammals is known as monoamine 
oxidases, where they participate in the conversion of amines involved in neuronal function. 
A prokaryotic flavin amine oxidase that deaminates putrescine has been described 
(Ishizuka et ah, J. Gen Microbiol 139:425-432 (1993)). A single fungal gene, from 

20 Aspergillus niger has been cloned (Schilling et al 9 Mol Gen Genet 247:430-438 (1995)). 
It deaminates a variety of alkyl and aryl amines, but when tested for its ability to oxidize 
API, was found to not contain API oxidizing activity. 

The toxicity of fiimonisins and their potential widespread occurrence in food and 
feed makes it imperative to find detoxification or elimination strategies to remove the 

25 compound from the food chain. 

Summary of the Invention 

The present invention provides polynucleotides and related polypeptides of newly 
discovered APAOs. SEQ ID NO: 5 contains the nucleotide sequence of an active, 
30 truncated APAO (trAPAO), SEQ ID NO: 10 contains the nucleotide sequence of trAPAO 
with an additional lysine and SEQ ID NO: 22, 35, 37, 39, 41, 43, and 45 comprise full 
length nucleotide sequences of APAOs isolated from different organisms. In addition, 
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APAO can be modified to eliminate glycosylation sites and/or cysteine residues, for 
example, see SEQ ID NOS: 32, 48, 50, and 52. Another aspect of the present invention is 
the method of predicting possible mutagenesis sites on APAO by developing a 3- 
dimensional model of APAO and then identifying the possible sites that may contribute to 

5 misfolding of the protein. The present invention also includes the 3-dimensional model of 
APAO generated by a computer modeling program, preferably the Modeler program. For 
expression in a plant, the polynucleotide of the present invention can be operably linked to 
a targeting sequence. It is an object of the present invention to provide transgenic plants 
comprising the nucleic acids of the present invention. 

10 Therefore, in one aspect, the present invention relates to an isolated APAO 

encoding polynucleotide ligated to a fumonisin esterase encoding polynucleotide wherein 
the APAO encoding polynucleotide comprises a member selected from (a) a 
polynucleotide encoding a polypeptide of the present invention; (b) a polynucleotide 
having at least 70% sequence identity to the polynucleotides of the present invention; and 

15 (c) a polynucleotide of the present invention. The isolated nucleic acid can be DNA. The 
isolated nucleic acid can also be RNA. Examples of fumonisin esterase genes include, but 
are not limited to ESP1 and BEST1. 

In another aspect, the present invention relates to vectors comprising the 
polynucleotides of the present invention, including ligated and non-ligated 

20 polynucleotides. Also the present invention relates to recombinant expression cassettes, 
comprising a polynucleotide of the present invention operably linked to a promoter. 

In another aspect, the present invention is directed to a host cell into which has 
been introduced the polynucleotides of the present invention, including a host cell 
comprising a fumonisin esterase ligated to an APAO or a fumonisin esterase not ligated to 

25 an APAO. 

In yet another aspect, the present invention relates to a transgenic plant or plant cell 
comprising a recombinant expression cassette with a promoter operably linked to any of 
the isolated polynucleotides of the present invention. Preferred plants containing the 
recombinant expression cassette of the present invention include but are not limited to 
30 maize, soybean, sunflower, sorghum, canola, wheat, alfalfa, cotton, rice, barley, tomato, 
and millet. The present invention also provides transgenic seed from the transgenic plant. 
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In another aspect, the present invention relates to an isolated protein selected from 
the group consisting of (a) a polypeptide comprising at least 70% sequence identity to a 
polypeptide of the present invention; (b) a polypeptide encoded by a nucleic acid of the 
present invention; and (c) a polypeptide characterized by a polypeptide of the present 
5 invention. 

This invention further provides methods of degrading fumonisin, a structurally 
related mycotoxin, a fumonisin breakdown product, or a structurally related mycotoxin 
breakdown product, by applying APAO as a spray or wash. Additionally, fumonisins and 
related mycotoxins can be degraded by the application of both fumonisin esterase enzymes 

10 and APAO enzymes. Mycotoxins can be degraded in harvested grain, during the 
processing of harvested grain, in animal feed, or in plant tissue as, for example, during the 
use of the plant for silage or as a spray on grain, fruit or vegetables. Further, this invention 
provides methods of degrading fumonisin, a structurally related mycotoxin, a fumonisin 
breakdown product, or a structurally related mycotoxin breakdown product, by 

15 transforming the APAO polynucleotide, alone or in combination with polynucleotides 
encoding a fumonisin esterase, into plant cells. 

The polynucleotides of the present invention can also be used as a selectable 
marker for plant transformation. By transforming plant cells with an expression cassette 
comprising a polynucleotide of the present invention and then placing the plant cells on 

20 media containing FBI, API or a phytotoxic analog, only the plant cells expressing the 
polynucleotide of the present invention would survive. 

Another embodiment of the present invention is the use of the enzyme fumonisin 
esterase and APAO by themselves or in combination as reagents for detecting fumonisin 
and structurally related toxins. 

25 

Brief Description of the Drawings 

Figure 1 shows a 3-dimensional model of APAO (IB) based on the crystal structure 
of a related amine oxidase from maize, maize polyamine oxidase (MP AO) (1 A). The sites 
for possible mutation of APAO to alter glycosylation sites or cysteine residues are shown. 
30 Figure 2 shows a 3-dimensional model of APAO (2B) based on the crystal structure 

of a related amine oxidase from maize MPAO (2B). The substrate binding holes are 
shown as circles. 
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Detailed Description of the Invention 

Unless defined otherwise, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 

5 invention belongs. Unless mentioned otherwise, the techniques employed or contemplated 
herein are standard methodologies well known to one of ordinary skill in the art. The 
materials, methods and examples are illustrative only and not limiting. The following is 
presented by way of illustration and is not intended to limit the scope of the invention. 

The practice of the present invention will employ, unless otherwise indicated, 

10 conventional techniques of botany, microbiology, tissue culture, molecular biology, 
chemistry, biochemistry and recombinant DNA technology, which are within the skill of 
the art. Such techniques are explained fully in the literature. See, e.g., J. H. Langenheim 
and K. V. Thimann, Botany: Plant Biology and Its Relation to Human Affairs (1982) John 
Wiley; Cell Culture and Somatic Cell Genetics of Plants, Vol. 1 (I. K. Vasil, ed. 1984); R. 

15 V. Stanier, J. L. Ingraham, M. L. Wheelis, and P. R. Painter, The Microbial World, (1986) 
5th Ed., Prentice-Hall; O. D. Dhringra and J. B. Sinclair, Basic Plant Pathology Methods, 
(1985) CRC Press; Maniatis, Fritsch & Sambrook, Molecular Cloning: A Laboratory 
Manual (1982); DNA Cloning, Vols, I and II (D. N. Glover ed. 1985); Oligonucleotide 
Synthesis (M. J. Gait ed. 1984); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins 

20 eds. 1984); and the series Methods in Enzymology (S. Colowick and N. Kaplan, eds., 
Academic Press, Inc.). 

Units, prefixes, and symbols may be denoted in their SI accepted form. Unless 
otherwise indicated, nucleic acids are written left to right in 5' to 3 f orientation; amino acid 
sequences are written left to right in amino to carboxy orientation, respectively. Numeric 

25 ranges are inclusive of the numbers defining the range. Amino acids may be referred to 
herein by either their commonly known three letter symbols or by the one-letter symbols 
recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Nucleotides, 
likewise, may be referred to by their commonly accepted single-letter codes. The terms 
defined below are more fully defined by reference to the specification as a whole. 

30 In describing the present invention, the following terms will be employed, and are 

intended to be defined as indicated below. 
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By "microbe" is meant any microorganism (including both eukaryotic and 
prokaryotic microorganisms), such as fungi, yeast, bacteria, actinomycetes, algae and 
protozoa, as well as other unicellular structures. 

A "fumonisin-producing microbe" is any microbe capable of producing the 
5 mycotoxin fumonisin or analogs thereof. Such microbes are generally members of the 
fungal genus Fusarium, as well as recombinantly derived organisms, which have been 
genetically altered to enable them to produce fumonisin or analogs thereof. 

By "degrading fumonisin" is meant any modification to fumonisin, API, or any 
derivative of fumonisin or API which causes a decrease or loss in its toxic activity, such as 

10 degradation to less than 1%, 5%, 10%, or 50% of original toxicity, with less than 10% 
being preferred. Such a change can comprise cleavage of any of the various bonds, 
oxidation, reduction, the addition or deletion of a chemical moiety, or any other change 
that affects the activity of the molecule. In a preferred embodiment, the modification 
includes hydrolysis of the ester linkage in the molecule as a first step and then oxidative 

15 deamination. Furthermore, chemically altered fumonisin can be isolated from cultures of 
microbes that produce an enzyme of this invention, such as growing the organisms on 
media containing radioactively-labeled fumonisin, tracing the label, and isolating the 
degraded toxin for further study. The degraded fumonisin can be compared to the active 
compound for its phytotoxicity or mammalian toxicity in known sensitive species, such as 

20 porcines, rabbits, and equines or in cell or tissue culture assays. Such toxicity assays are 
known in the art. For example, in plants a whole leaf bioassay can be used in which 
solutions of the active and inactive compound are applied to the leaves of sensitive plants. 
The leaves may be treated in situ or, alternatively, excised leaves may be used. The 
relative toxicity of the compounds can be estimated by grading the ensuing damage to the 

25 plant tissues and by measuring the size of lesions formed within a given time period. 
Other known assays can be performed at the cellular level, employing standard tissue 
culture methodologies e.g., using cell suspension cultures. 

By "fumonisin esterase" is meant any enzyme capable of hydrolysis of the ester 
linkage in fumonisin or a structurally similar molecule such as AAL toxin. Two examples 

30 of such enzymes are ESP1 and BEST1 found in US patent no. 5,716,820, issued February 
10, 1998; US patent no. 5,792,931, issued August 11, 1998; US patent no. 6,025,188, 
issued February 15, 2000; and pending US application no. 08/888,950, filed July 7, 1997. 
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By "structurally related mycotoxin" is meant any mycotoxin having a chemical 
structure related to a fumonisin or API such as AAL toxin, fumonisin Bl, fumonisin B2, 
fumonisin B3, fumonisin B4, fumonisin CI, fumonisin Al and A2, and their analogs or 
hydrolyzed forms, as well as other mycotoxins having similar chemical structures, 

5 including synthetically made analogs that contain a C-2 or C-l amine group and one or 
more adjacent hydroxy 1 groups, that would be expected to be degraded by the activity of an 
enzyme of the present invention. The present invention is the first flavin amine oxidase 
known to attack a primary amine not located at C-l (i.e. C-2 of API) and resulting in a 
keto rather than an aldehydic product. 

10 It is understood that "API" or "amino polyol" as used here is to designate the 

hydrolyzed form of any fumonisin, FBI, FB2, FB3, FB4, AAL, or any other API-like 
compound, including a compound made synthetically, that contains a C-2 or C-l amine 
group and one or more adjacent hydroxyl groups. 

By "amplified" is meant the construction of multiple copies of a nucleic acid 

15 sequence or multiple copies complementary to the nucleic acid sequence using at least one 
of the nucleic acid sequences as a template. Amplification systems include the polymerase 
chain reaction (PCR) system, ligase chain reaction (LCR) system, nucleic acid sequence 
based amplification (NASBA, Cangene, Mississauga, Ontario), Q-Beta Replicase systems, 
transcription-based amplification system (TAS), and strand displacement amplification 

20 (SDA). See, e.g., Diagnostic Molecular Microbiology: Principles and Applications, D. H. 
Persing et al 9 Ed., American Society for Microbiology, Washington, DC (1993). The 
product of amplification is termed an amplicon. 

The term "conservatively modified variants" applies to both amino acid and nucleic 
acid sequences. With respect to particular nucleic acid sequences, conservatively modified 

25 variants refer to those nucleic acids that encode identical or conservatively modified 
variants of the amino acid sequences. Because of the degeneracy of the genetic code, a 
large number of functionally identical nucleic acids encode any given protein. For 
instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, 
at every position where an alanine is specified by a codon, the codon can be altered to any 

30 of the corresponding codons described without altering the encoded polypeptide. Such 
nucleic acid variations are "silent variations" and represent one species of conservatively 
modified variation. Every nucleic acid sequence herein that encodes a polypeptide also 
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describes every possible silent variation of the nucleic acid. One of ordinary skill will 
recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only 
codon for methionine, one exception is Micrococcus rubens, for which GTG is the 
methionine codon (Ishizuka, et al, X Genl Microbiol, 139:425-432 (1993)) can be 

5 modified to yield a functionally identical molecule. Accordingly, each silent variation of a 
nucleic acid, which encodes a polypeptide of the present invention, is implicit in each 
described polypeptide sequence and incorporated herein by reference. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 

10 sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" when the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. Thus, 
any number of amino acid residues selected from the group of integers consisting of from 1 
to 15 can be so altered. Thus, for example, 1, 2, 3, 4, 5, 7, or 10 alterations can be made. 

15 Conservatively modified variants typically provide similar biological activity as the 
unmodified polypeptide sequence from which they are derived. For example, substrate 
specificity, enzyme activity, or ligand/receptor binding is generally at least 30%, 40%, 
50%, 60%, 70%>, 80%>, or 90%, preferably 60-90% of the native protein for it's native 
substrate. Conservative substitution tables providing functionally similar amino acids are 

20 well known in the art. 

The following six groups each contain amino acids that are conservative 
substitutions for one another: 

1) Alanine (A), Serine (S), Threonine (T); 

2) Aspartic acid (D), Glutamic acid (E); 
25 3) Asparagine (N), Glutamine (Q); 

4) Arginine (R), Lysine (K); 

5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and 

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 

See also, Creighton (1984) Proteins W.H. Freeman and Company. 
30 As used herein, "consisting essentially of means the inclusion of additional 

sequences to an object polynucleotide where the additional sequences do not selectively 
hybridize, under stringent hybridization conditions, to the same cDNA as the 
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polynucleotide and where the hybridization conditions include a wash step in 0.1X SSC 
and 0.1% sodium dodecyl sulfate at 65 °C. 

By "encoding" or "encoded", with respect to a specified nucleic acid, is meant 
comprising the information for translation into the specified protein. A nucleic acid 

5 encoding a protein may comprise non-translated sequences (e.g., introns) within translated 
regions of the nucleic acid, or may lack such intervening non-translated sequences (e.g., as 
in cDNA). The information by which a protein is encoded is specified by the use of 
codons. Typically, the amino acid sequence is encoded by the nucleic acid using the 
"universal" genetic code. However, variants of the universal code, such as is present in 

10 some plant, animal, and fungal mitochondria, the bacterium Mycoplasma capricolum 
(Proc. Natl Acad Set (USA), 82: 2306-2309 (1985)), or the ciliate Macronucleus, may be 
used when the nucleic acid is expressed using these organisms. 

When the nucleic acid is prepared or altered synthetically, advantage can be taken 
of known codon preferences of the intended host where the nucleic acid is to be expressed. 

15 For example, although nucleic acid sequences of the present invention may be expressed in 
both monocotyledonous and dicotyledonous plant species, sequences can be modified to 
account for the specific codon preferences and GC content preferences of 
monocotyledonous plants or dicotyledonous plants as these preferences have been shown 
to differ (Murray et al Nucl. Acids Res. 17: 477-498 (1989) and herein incorporated by 

20 reference). Thus, the maize preferred codon for a particular amino acid might be derived 
from known gene sequences from maize. Maize codon usage for 28 genes from maize 
plants is listed in Table 4 of Murray et ah, supra. 

As used herein, "heterologous" in reference to a nucleic acid is a nucleic acid that 
originates from a foreign species, or, if from the same species, is substantially modified 

25 from its native form in composition and/or genomic locus by deliberate human 
intervention. For example, a promoter operably linked to a heterologous structural gene is 
from a species different from that from which the structural gene was derived, or, if from 
the same species, one or both are substantially modified from their original form. A 
heterologous protein may originate from a foreign species or, if from the same species, is 

30 substantially modified from its original form by deliberate human intervention. 

By "host cell" or "recombinantly engineered cell" is meant a cell, which contains a 
vector and supports the replication and/or expression of the expression vector. Host cells 
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may be prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, Pichia, insect, 
plant, amphibian, or mammalian cells. Preferably, host cells are monocotyledonous or 
dicotyledonous plant cells, including but not limited to maize, sorghum, sunflower, 
soybean, wheat, alfalfa, rice, cotton, canola, barley, millet, and tomato. A particularly 
5 preferred monocotyledonous host cell is a maize host cell. 

The term "hybridization complex" includes reference to a duplex nucleic acid 
structure formed by two single-stranded nucleic acid sequences selectively hybridized with 
each other. 

The term "introduced" in the context of inserting a nucleic acid into a cell, means 

10 "transfection" or "transformation" or "transduction" and includes reference to the 
incorporation of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid 
may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or 
mitochondrial DNA), converted into an autonomous replicon, or transiently expressed 
(e.g., transfected mRNA). 

15 The term "isolated" refers to material, such as a nucleic acid or a protein, which is 

substantially or essentially free from components which normally accompany or interact 
with it as found in its naturally occurring environment. The isolated material optionally 
comprises material not found with the material in its natural environment. Nucleic acids, 
which are "isolated", as defined herein, are also referred to as "heterologous" nucleic acids. 

20 Unless otherwise stated, the term "APAO nucleic acid" means a nucleic acid 

comprising a polynucleotide ("APAO polynucleotide") encoding an APAO polypeptide. 
The term APAO, unless otherwise stated can encompass both APAO and the functional, 
truncated version of APAO designated trAPAO. 

As used herein, "nucleic acid" includes reference to a deoxyribonucleotide or 

25 ribonucleotide polymer in either single- or double-stranded form, and unless otherwise 
limited, encompasses known analogues having the essential nature of natural nucleotides 
in that they hybridize to single-stranded nucleic acids in a manner similar to naturally 
occurring nucleotides (e.g., peptide nucleic acids). 

By "nucleic acid library" is meant a collection of isolated DNA or RNA molecules, 

30 which comprise and substantially represent the entire transcribed fraction of a genome of a 
specified organism. Construction of exemplary nucleic acid libraries, such as genomic and 
cDNA libraries, is taught in standard molecular biology references such as Berger and 
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Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology, Vol. 152, 
Academic Press, Inc., San Diego, CA (Berger); Sambrook et al, Molecular Cloning - A 
Laboratory Manual, 2nd ed., Vol 1-3 (1989); and Current Protocols in Molecular 
Biology, F.M. Ausubel et al, Eds., Current Protocols, a joint venture between Greene 

5 Publishing Associates, Inc. and John Wiley & Sons, Inc. (1994 Supplement). 

As used herein "operably linked" includes reference to a functional linkage between 
a promoter and a second sequence, wherein the promoter sequence initiates and mediates 
transcription of the DNA sequence corresponding to the second sequence. Generally, 
operably linked means that the nucleic acid sequences being linked are contiguous and, 

10 where necessary to join two protein coding regions, contiguous and in the same reading 
frame. 

The term "ligated" or "ligation" refers to the process of forming phosphodiester 
bonds between two or more polynucleotides, which most often are double stranded DNAs. 
Techniques for ligation are well known in the art and protocols are described in standard 

15 laboratory manuals and references, such as, Sambrook, et al Molecular Cloning: A 
Laboratory Manual, 2ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York (1989). The two polynucleotides can include, but are not limited to, a 
polynucleotide, which can function as a promoter, ligated to a polynucleotide capable of 
encoding a polypeptide or linking two polynucleotides each capable of encoding a 

20 polypeptide. In the case of joining two polynucleotides that each encode a polypeptide, a 
polynucleotide spacer region between the two polynucleotides may or may not be present. 
The spacer region may encode a polypeptide containing a protease cleavage site. 
Optionally, the spacer region may contain a polynucleotide cleavage site such as but not 
limited to a site for RNAse cleavage or a self-cleaving ribozyme (See, e.g., Tanner, FEMS 

25 Microbiol Rev, 23(3):257-75 (1999)). Alternatively, the transcription of the two or more 
ligated polynucleotides may result in a polycistronic message. An example of a spacer 
sequence that would direct translation of downstream coding sequences is an intervening 
ribosomal entry site (IRES) (See, e.g., Liu, et al., Anal Biochem, 280(l):20-28 (2000)). 
The length of the spacer region may be of any length that results in a functional 

30 polypeptide or polypeptides. For example, the spacer region may be from 1 nucleotide to 
1000 nucleotides, preferably 24 nucleotides in length. 
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As used herein, the term "plant" includes reference to whole plants, plant organs 
(e.g., leaves, stems, roots, etc.), seeds and plant cells and progeny of same. Plant cell, as 
used herein includes, without limitation, seeds suspension cultures, embryos, meristematic 
regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and 

5 microspores. The class of plants, which can be used in the methods of the invention, is 
generally as broad as the class of higher plants amenable to transformation techniques, 
including both monocotyledonous and dicotyledonous plants including species from the 
genera: Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, Medicago, Onobrychis, 
Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, 

10 Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Ly coper sicon, 
Nicotiana, Solarium, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, 
Bromus, Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, 
Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Pisum, 
Phaseolus, Lolium, Oryza, Avena, Hordeum, Secale, Allium, and Triticum. A particularly 

15 preferred plant is Zea mays. 

As used herein, "polynucleotide" includes reference to a deoxyribopolynucleotide, 
ribopolynucleotide, or analogs thereof that have the essential nature of a natural 
ribonucleotide in that they hybridize, under stringent hybridization conditions, to 
substantially the same nucleotide sequence as naturally occurring nucleotides and/or allow 

20 translation into the same amino acid(s) as the naturally occurring nucleotide(s). A 
polynucleotide can be full-length or a subsequence of a native or heterologous structural or 
regulatory gene. Unless otherwise indicated, the term includes reference to the specified 
sequence as well as the complementary sequence thereof. Thus, DNAs or RNAs with 
backbones modified for stability or for other reasons are "polynucleotides" as that term is 

25 intended herein. Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or 
modified bases, such as tritylated bases, to name just two examples, are polynucleotides as 
the term is used herein. It will be appreciated that a great variety of modifications have been 
made to DNA and RNA that serve many useful purposes known to those of skill in the art. 
The term polynucleotide as it is employed herein embraces such chemically, enzymatically 

30 or metabolically modified forms of polynucleotides, as well as the chemical forms of DNA 
and RNA characteristic of viruses and cells, including inter alia, simple and complex cells. 
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The terms "polypeptide", "peptide" and "protein" are used interchangeably herein 
to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in 
which one or more amino acid residue is an artificial chemical analogue of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers. 
5 As used herein "promoter" includes reference to a region of DNA upstream from 

the start of transcription and involved in recognition and binding of RNA polymerase and 
other proteins to initiate transcription. A "plant promoter" is a promoter capable of 
initiating transcription in plant cells. Exemplary plant promoters include, but are not 
limited to, those that are obtained from plants, plant viruses, and bacteria which comprise 
10 genes expressed in plant cells such Agrobacterium or Rhizobium. Examples are promoters 
that preferentially initiate transcription in certain tissues, such as leaves, roots, seeds, 
fibres, xylem vessels, tracheids, or sclerenchyma. Such promoters are referred to as "tissue 
preferred". A "cell type" specific promoter primarily drives expression in certain cell types 
in one or more organs, for example, vascular cells in roots or leaves. An "inducible" or 
15 "regulatable" promoter is a promoter, which is under environmental control. Examples of 
environmental conditions that may effect transcription by inducible promoters include 
anaerobic conditions or the presence of light. Another type of promoter is a 
developmental^ regulated promoter, for example, a promoter that drives expression during 
pollen development. Tissue preferred, cell type specific, developmentally regulated, and 
20 inducible promoters constitute the class of "non-constitutive" promoters. A "constitutive" 
promoter is a promoter, which is active under most environmental conditions. 

The term "APAO polypeptide or trAPAO polypeptide" refers to one or more amino 
acid sequences. The term is also inclusive of fragments, variants, homologs, alleles or 
precursors (e.g., preproproteins or proproteins) thereof. An "APAO or trAPAO protein" 
25 comprises an APAO or trAPAO polypeptide. 

As used herein "recombinant" includes reference to a cell or vector, that has been 
modified by the introduction of a heterologous nucleic acid or that the cell is derived from 
a cell so modified. Thus, for example, recombinant cells express genes that are not found 
in identical form within the native (non-recombinant) form of the cell or express native 
30 genes that are otherwise abnormally expressed, under expressed or not expressed at all as a 
result of deliberate human intervention. The term "recombinant" as used herein does not 
encompass the alteration of the cell or vector by naturally occurring events (e.g., 
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spontaneous mutation, natural transformation/transduction/transposition) such as those 
occurring without deliberate human intervention. 

As used herein, a "recombinant expression cassette" is a nucleic acid construct, 
generated recombinantly or synthetically, with a series of specified nucleic acid elements, 
which permit transcription of a particular nucleic acid in a target cell. The recombinant 
expression cassette can be incorporated into a plasmid, chromosome, mitochondrial DNA, 
plastid DNA, virus, or nucleic acid fragment. Typically, the recombinant expression 
cassette portion of an expression vector includes, among other sequences, a nucleic acid to 
be transcribed, and a promoter. 

The term "residue 11 or "amino acid residue" or "amino acid" are used 
interchangeably herein to refer to an amino acid that is incorporated into a protein, 
polypeptide, or peptide (collectively "protein"). The amino acid may be a naturally 
occurring amino acid and, unless otherwise limited, may encompass known analogs of 
natural amino acids that can function in a similar manner as naturally occurring amino 
acids. 

The term "selectively hybridizes" includes reference to hybridization, under 
stringent hybridization conditions, of a nucleic acid sequence to a specified nucleic acid 
target sequence to a detectably greater degree (e.g., at least 2-fold over background) than 
its hybridization to non-target nucleic acid sequences and to the substantial exclusion of 
non-target nucleic acids. Selectively hybridizing sequences typically have about at least 
40% sequence identity, preferably 60-90% sequence identity, and most preferably 100% 
sequence identity (i.e., complementary) with each other. 

The terms "stringent conditions" or "stringent hybridization conditions" include 
reference to conditions under which a probe will hybridize to its target sequence, to a 
detectably greater degree than other sequences (e.g., at least 2-fold over background). 
Stringent conditions are sequence-dependent and will be different in different 
circumstances. By controlling the stringency of the hybridization and/or washing 
conditions, target sequences can be identified which can be up to 100% complementary to 
the probe (homologous probing). Alternatively, stringency conditions can be adjusted to 
allow some mismatching in sequences so that lower degrees of similarity are detected 
(heterologous probing). Optimally, the probe is approximately 500 nucleotides in length, 
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but can vary greatly in length from less than 500 nucleotides to equal to the entire length of 
the target sequence. 

Typically, stringent conditions will be those in which the salt concentration is less 
than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other 

5 salts) at pH 7.0 to 83 and the temperature is at least about 30°C for short probes (e.g., 10 
to 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide or Denhardt's. Exemplary low stringency conditions include 
hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCl, 1% SDS (sodium 

10 dodecyl sulphate) at 37°C, and a wash in IX to 2X SSC (20X SSC = 3.0 M NaCl/0.3 M 
trisodium citrate) at 50 to 55°C. Exemplary moderate stringency conditions include 
hybridization in 40 to 45% formamide, 1 M NaCl, 1% SDS at 37°C, and a wash in 0.5X to 
IX SSC at 55 to 60°C. Exemplary high stringency conditions include hybridization in 
50% formamide, 1 M NaCl, 1% SDS at 37°C, and a wash in 0.1X SSC at 60 to 65°C. 

15 Specificity is typically the function of post-hybridization washes, the critical factors being 
the ionic strength and temperature of the final wash solution. For DNA-DNA hybrids, the 
T m can be approximated from the equation of Meinkoth and Wahl, Anal Biochem., 
138:267-284 (1984): T m = 81.5 °C + 16.6 (log M) + 0.41 (%GC) - 0.61 (% form) - 500/L; 
where M is the molarity of monovalent cations, %GC is the percentage of guanosine and 

20 cytosine nucleotides in the DNA, % form is the percentage of formamide in the 
hybridization solution, and L is the length of the hybrid in base pairs. The T m is the 
temperature (under defined ionic strength and pH) at which 50% of a complementary 
target sequence hybridizes to a perfectly matched probe. T m is reduced by about 1 °C for 
each 1% of mismatching; thus, T m , hybridization and/or wash conditions can be adjusted to 

25 hybridize to sequences of the desired identity. For example, if sequences with >90% 
identity are sought, the T m can be decreased 10 °C. Generally, stringent conditions are 
selected to be about 5 °C lower than the thermal melting point (T J for the specific 
sequence and its complement at a defined ionic strength and pH. However, severely 
stringent conditions can utilize a hybridization and/or wash at 1, 2, 3, or 4 °C lower than 

30 the thermal melting point (T m ); moderately stringent conditions can utilize a hybridization 
and/or wash at 6, 7, 8, 9, or 10 °C lower than the thermal melting point (T m ); low 
stringency conditions can utilize a hybridization and/or wash at 1 1, 12, 13, 14, 15, or 20 °C 
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lower than the thermal melting point (TJ. Using the equation, hybridization and wash 
compositions, and desired T m , those of ordinary skill will understand that variations in the 
stringency of hybridization and/or wash solutions are inherently described. If the desired 
degree of mismatching results in a T m of less than 45 °C (aqueous solution) or 32 °C 
(formamide solution) it is preferred to increase the SSC concentration so that a higher 
temperature can be used. An extensive guide to the hybridization of nucleic acids is found 
in Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization 
with Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and 
the strategy of nucleic acid probe assays", Elsevier, New York (1993); and Current 
Protocols in Molecular Biology, Chapter 2, Ausubel, et a/., Eds., Greene Publishing and 
Wiley-Interscience, New York (1995). Unless otherwise stated, in the present application 
high stringency is defined as hybridization in 4X SSC, 5X Denhardt's (5g Ficoll, 5g 
polyvinypyrrolidone, 5 g bovine serum albumin in 500ml of water), 0.1 mg/ml boiled 
salmon sperm DNA, and 25 mM Na phosphate at 65°C, and a wash in 0.1X SSC, 0.1% 
SDS at 65°C. 

As used herein, "transgenic plant" includes reference to a plant, which comprises 
within its genome a heterologous polynucleotide. Generally, the heterologous 
polynucleotide is stably integrated within the genome such that the polynucleotide is 
passed on to successive generations. The heterologous polynucleotide may be integrated 
into the genome alone or as part of a recombinant expression cassette. "Transgenic" is used 
herein to include any cell, cell line, callus, tissue, plant part or plant, the genotype of which 
has been altered by the presence of heterologous nucleic acid including those transgenics 
initially so altered as well as those created by sexual crosses or asexual propagation from 
the initial transgenic. The term "transgenic" as used herein does not encompass the 
alteration of the genome (chromosomal or extra-chromosomal) by conventional plant 
breeding methods or by naturally occurring events such as random cross-fertilization, non- 
recombinant viral infection, non-recombinant bacterial transformation, non-recombinant 
transposition, or spontaneous mutation. 

As used herein, "vector" includes reference to a nucleic acid used in transfection of 
a host cell and into which can be inserted a polynucleotide. Vectors are often replicons. 
Expression vectors permit transcription of a nucleic acid inserted therein. 
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The following terms are used to describe the sequence relationships between two or 
more nucleic acids or polynucleotides or polypeptides: (a) "reference sequence", (b) 
"comparison window", (c) "sequence identity", (d) "percentage of sequence identity", and 
(e) "substantial identity". 

(a) As used herein, "reference sequence" is a defined sequence used as a basis for 
sequence comparison. A reference sequence may be a subset or the entirety of a specified 
sequence; for example, as a segment of a full-length cDNA or gene sequence, or the 
complete cDNA or gene sequence. 

(b) As used herein, "comparison window" means includes reference to a contiguous 
and specified segment of a polynucleotide sequence, wherein the polynucleotide sequence 
may be compared to a reference sequence and wherein the portion of the polynucleotide 
sequence in the comparison window may comprise additions or deletions (i.e., gaps) 
compared to the reference sequence (which does not comprise additions or deletions) for 
optimal alignment of the two sequences. Generally, the comparison window is at least 20 
contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or longer. Those 
of skill in the art understand that to avoid a high similarity to a reference sequence due to 
inclusion of gaps in the polynucleotide sequence a gap penalty is typically introduced and 
is subtracted from the number of matches. 

Methods of alignment of nucleotide and amino acid sequences for comparison are 
well known in the art. The local homology algorithm (Best Fit) of Smith and Waterman, 
Adv. Appl. Math may conduct optimal alignment of sequences for comparison. 2: 482 
(1981); by the homology alignment algorithm (GAP) of Needleman and Wunsch, J. Mol 
Biol 48: 443 (1970); by the search for similarity method (Tfasta and Fasta) of Pearson and 
Lipman, Proc. Natl Acad Set 85: 2444 (1988); by computerized implementations of these 
algorithms, including, but not limited to: CLUSTAL in the PC/Gene program by 
Intelligenetics, Mountain View, California, GAP, BESTFIT, BLAST, FASTA, and 
TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 
575 Science Dr., Madison, Wisconsin, USA; the CLUSTAL program is well described by 
Higgins and Sharp, Gene 73: 237-244 (1988); Higgins and Sharp, CABIOS 5: 151-153 
(1989); Corpet, et al f Nucleic Acids Research 16: 10881-90 (1988); Huang, et al, 
Computer Applications in the Biosciences 8: 155-65 (1992), and Pearson, et al, Methods 
in Molecular Biology 24: 307-331 (1994). The preferred program to use for optimal 
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global alignment of multiple sequences is PileUp (Feng and Doolittle, Journal of 
Molecular Evolution, 25:351-360 (1987) which is similar to the method described by 
Higgins and Sharp, CABIOS, 5:151-153 (1989) and hereby incorporated by reference). The 
BLAST family of programs which can be used for database similarity searches includes: 
BLASTN for nucleotide query sequences against nucleotide database sequences; BLASTX 
for nucleotide query sequences against protein database sequences; BLASTP for protein 
query sequences against protein database sequences; TBLASTN for protein query 
sequences against nucleotide database sequences; and TBLASTX for nucleotide query 
sequences against nucleotide database sequences. See, Current Protocols in Molecular 
Biology, Chapter 19, Ausubel, et aL, Eds., Greene Publishing and Wiley-Interscience, New 
York (1995). 

GAP uses the algorithm of Needleman and Wunsch (J. Mol. Biol. 48: 443-453, 
1970) to find the alignment of two complete sequences that maximizes the number of 
matches and minimizes the number of gaps. GAP considers all possible alignments and 
gap positions and creates the alignment with the largest number of matched bases and the 
fewest gaps. It allows for the provision of a gap creation penalty and a gap extension 
penalty in units of matched bases. GAP must make a profit of gap creation penalty number 
of matches for each gap it inserts. If a gap extension penalty greater than zero is chosen, 
GAP must, in addition, make a profit for each gap inserted of the length of the gap times 
the gap extension penalty. Default gap creation penalty values and gap extension penalty 
values in Version 10 of the Wisconsin Genetics Software Package are 8 and 2, 
respectively. The gap creation and gap extension penalties can be expressed as an integer 
selected from the group of integers consisting of from 0 to 100. Thus, for example, the gap 
creation and gap extension penalties can be 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 
or greater. 

GAP presents one member of the family of best alignments. There may be many 
members of this family, but no other member has a better quality. GAP displays four 
figures of merit for alignments: Quality, Ratio, Identity, and Similarity. The Quality is the 
metric maximized in order to align the sequences. Ratio is the quality divided by the 
number of bases in the shorter segment. Percent Identity is the percent of the symbols that 
actually match. Percent Similarity is the percent of the symbols that are similar. Symbols 
that are across from gaps are ignored. A similarity is scored when the scoring matrix value 
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for a pair of symbols is greater than or equal to 0.50, the similarity threshold. The scoring 
matrix used in Version 10 of the Wisconsin Genetics Software Package is BLOSUM62 
(see Henikoff & Henikoff (1989) Proc. Natl Acad. Set. USA 89:10915). 

Unless otherwise stated, sequence identity/similarity values provided herein refer to 

5 the value obtained using the BLAST 2.0 suite of programs using default parameters. 
Altschul et aL, Nucleic Acids Res. 25:3389-3402 (1997). 

As those of ordinary skill in the art will understand, BLAST searches assume that 
proteins can be modeled as random sequences. However, many real proteins comprise 
regions of nonrandom sequences, which may be homopolymeric tracts, short-period 

10 repeats, or regions enriched in one or more amino acids. Such low-complexity regions 
may be aligned between unrelated proteins even though other regions of the protein are 
entirely dissimilar. A number of low-complexity filter programs can be employed to 
reduce such low-complexity alignments. For example, the SEG (Wooten and Federhen, 
Comput. Chem., 17:149-163 (1993)) and XNU (Claverie and States, Comput Chem., 

15 17:191-201 (1993)) low-complexity filters can be employed alone or in combination. 

(c) As used herein, "sequence identity" or "identity" in the context of two nucleic 
acid or polypeptide sequences includes reference to the residues in the two sequences, 
which are the same when aligned for maximum correspondence over a specified 
comparison window. When percentage of sequence identity is used in reference to 

20 proteins it is recognized that residue positions which are not identical often differ by 
conservative amino acid substitutions, where amino acid residues are substituted for other 
amino acid residues with similar chemical properties (e.g. charge or hydrophobicity) and 
therefore do not change the functional properties of the molecule. Where sequences differ 
in conservative substitutions, the percent sequence identity may be adjusted upwards to 

25 correct for the conservative nature of the substitution. Sequences, which differ by such 
conservative substitutions, are said to have "sequence similarity" or "similarity". Means 
for making this adjustment are well known to those of skill in the art. Typically this 
involves scoring a conservative substitution as a partial rather than a full mismatch, 
thereby increasing the percentage sequence identity. Thus, for example, where an identical 

30 amino acid is given a score of 1 and a non-conservative substitution is given a score of 
zero, a conservative substitution is given a score between zero and 1. The scoring of 
conservative substitutions is calculated, e.g., according to the algorithm of Meyers and 
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Miller, Computer Applic. Biol ScL, 4: 11-17 (1988) e.g., as implemented in the program 
PC/GENE (Intelligenetics, Mountain View, California, USA). 

(d) As used herein, "percentage of sequence identity" means the value determined 
by comparing two optimally aligned sequences over a comparison window, wherein the 

5 portion of the polynucleotide sequence in the comparison window may comprise additions 
or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two sequences. The percentage is 
calculated by determining the number of positions at which the identical nucleic acid base 
or amino acid residue occurs in both sequences to yield the number of matched positions, 

10 dividing the number of matched positions by the total number of positions in the window 
of comparison and multiplying the result by 100 to yield the percentage of sequence 
identity. 

(e) (i) The term "substantial identity" of polynucleotide sequences means that a 
polynucleotide comprises a sequence that has between 50-100% sequence identity, 

15 preferably at least 50% sequence identity, preferably at least 60% sequence identity, 
preferably at least 70%, more preferably at least 80%, more preferably at least 90% and 
most preferably at least 95%, compared to a reference sequence using one of the alignment 
programs described using standard parameters. One of skill will recognize that these 
values can be appropriately adjusted to determine corresponding identity of proteins 

20 encoded by two nucleotide sequences by taking into account codon degeneracy, amino 
acid similarity, reading frame positioning and the like. Substantial identity of amino acid 
sequences for these purposes normally means sequence identity of between 40-100%, 
preferably at least 55%, preferably at least 60%, more preferably at least 70%, 80%, 90%, 
and most preferably at least 95%. 

25 Another indication that nucleotide sequences are substantially identical is if two 

molecules hybridize to each other under stringent conditions. The degeneracy of the 
genetic code allows for many amino acids substitutions that lead to variety in the 
nucleotide sequence that code for the same amino acid, hence it is possible that the DNA 
sequence could code for the same polypeptide but not hybridize to each other under 

30 stringent conditions. This may occur, e.g., when a copy of a nucleic acid is created using 
the maximum codon degeneracy permitted by the genetic code. One indication that two 
nucleic acid sequences are substantially identical is that the polypeptide, which the first 
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nucleic acid encodes, is immunologically cross reactive with the polypeptide encoded by 
the second nucleic acid. 

(e) (ii) The terms "substantial identity" in the context of a peptide indicates that a 
peptide comprises a sequence with between 55-100% sequence identity to a reference 
sequence preferably at least 55% sequence identity, preferably 60% preferably 70%, more 
preferably 80%, most preferably at least 90% or 95% sequence identity to the reference 
sequence over a specified comparison window. Preferably, optimal alignment is 
conducted using the homology alignment algorithm of Needleman and Wunsch, J. Mol 
Biol 48: 443 (1970). An indication that two peptide sequences are substantially identical 
is that one peptide is immunologically reactive with antibodies raised against the second 
peptide. Thus, a peptide is substantially identical to a second peptide, for example, where 
the two peptides differ only by a conservative substitution. In addition, a peptide can be 
substantially identical to a second peptide when they differ by a non-conservative change if 
the epitope that the antibody recognizes is substantially identical. Peptides, which are 
"substantially similar" share sequences as, noted above except that residue positions, 
which are not identical, may differ by conservative amino acid changes. 

Fumonisin Degrading Organisms 

The present invention is based on the discovery of organisms with the ability to 
degrade the mycotoxin fumonisin. In a search for a biological means of detoxifying 
fumonisins, several dematiaceous hyphomycetes were isolated from field-grown maize 
kernels. The fungi were found to be capable of growing on fumonisin Bl or B2 (FBI or 
FB2) as a sole carbon source, degrading it partially or completely in the process. One 
species, identified as Exophiala spinifera, a "black yeast", was recovered from maize seed 
from diverse locations in the southeastern and south central US. The enzyme-active strain 
of Exophiala spinifera (ATCC 74269) was deposited (see US patent no. 5,716,820, issued 
February 10, 1998, US patent no. 5,792,931, issued August 11, 1998; US patent no. 
6,025,188, issued February 15, 2000; and pending US application no. 08/888,950, filed 
July 7, 1997). Other enzyme-active strains of Exophiala spinifera were used to isolate 
APAO polynucleotides. Isolate ESP002 was isolated from palm trees (ATCC 26089) and 
isolate ESP003 was isolated from maize seed. Another fungus from which APAO 
polynucleotides were isolated was Rhinocladiella atrovirens (RAT 01 1). 
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Nucleic Acids 

The present invention provides, inter alia, isolated nucleic acids of RNA, DNA, 
and analogs and/or chimeras thereof, comprising an APAO or trAPAO polynucleotide. 

5 The present invention also includes polynucleotides optimized for expression in 

different organisms. For example, for expression of the polynucleotide in a maize plant, 
the sequence can be altered to account for specific codon preferences and to alter GC 
content as according to Murray et al, supra. Maize codon usage for 28 genes from maize 
plants is listed in Table 4 of Murray, et al, supra, 

10 The APAO or trAPAO nucleic acids of the present invention comprise isolated 

APAO or trAPAO polynucleotides which, are inclusive of: 

(a) a polynucleotide encoding an APAO or trAPAO polypeptide of the sequences 
shown in SEQ ID NOS: 36, 38, 40, 42, 44, and 46, and conservatively modified and 
polymorphic variants thereof; 

15 (b) a polynucleotide which selectively hybridizes to a polynucleotide of (a) or (b); 

(c) a polynucleotide having at least 50% sequence identity with polynucleotides of 
(a)or(b); 

(d) complementary sequences of polynucleotides of (a), (b), or (c); and 

(e) a polynucleotide comprising at least 25 contiguous nucleotides from a 
20 polynucleotide of (a), (b), (c), or (d). 

In addition, polynucleotides are presented that are a fusion of an APAO or trAPAO 
polynucleotide and the polynucleotide of a fumonisin esterase. The invention 
encompasses the sequences from Exophiala or Rhinocladiella as well as sequences having 
sequence similarity with such sequences. It is recognized that the sequences of the 

25 invention can be used to isolate corresponding sequences in other organisms. Methods 
such as PCR, hybridization, and the like can be used to identify sequences having 
substantial sequence similarity to the sequences of the invention. See, for example, 
Sambrook, et aL, (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring 
Harbor Laboratory Press, Planview, New York) and Innis et al, (1990) PCR Protocols: 

30 Guide to Methods and Applications (Academic Press, New York). Coding sequences 
isolated based on their sequence identity to the entire fumonisin degrading coding 



24 



sequences set forth herein or to fragments thereof are encompassed by the present 
invention. 

It is recognized that the sequences of the invention can be used to isolate similar 
sequences from other fumonisin degrading organisms. Likewise sequences from other 

5 fumonisin degrading organisms may be used in combination with the sequences of the 
present invention. See, for example, copending application entitled "Compositions and 
Methods for Fumonisin Detoxification", U.S. application serial number 60/092,953, filed 
concurrently herewith and herein incorporated by reference. 

Plasmids containing the polynucleotide sequences of the invention were deposited 

10 with American Type Culture Collection (ATCC), Manassas, Virginia, and assigned 
Accession Nos. 98812, 98813, 98814, 98815, 98816, and PTA-32. These deposits will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent Procedure. These deposits were 
made merely as a convenience for those of skill in the art and are not an admission that a 

15 deposit is required under 35 U.S.C. § 1 12. 

Construction of Nucleic Acids 

The isolated nucleic acids of the present invention can be made using (a) standard 
recombinant methods, (b) synthetic techniques, or combinations thereof. In some 

20 embodiments, the polynucleotides of the present invention will be cloned, amplified, or 
otherwise constructed from a fungus or bacteria. 

The nucleic acids may conveniently comprise sequences in addition to a 
polynucleotide of the present invention. For example, a multi-cloning site comprising one 
or more endonuclease restriction sites may be inserted into the nucleic acid to aid in 

25 isolation of the polynucleotide. Also, translatable sequences may be inserted to aid in the 
isolation of the translated polynucleotide of the present invention. For example, a hexa- 
histidine marker sequence provides a convenient means to purify the proteins of the 
present invention. The nucleic acid of the present invention - excluding the polynucleotide 
sequence - is optionally a vector, adapter, or linker for cloning and/or expression of a 

30 polynucleotide of the present invention. Additional sequences may be added to such 
cloning and/or expression sequences to optimize their function in cloning and/or 
expression, to aid in isolation of the polynucleotide, or to improve the introduction of the 
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polynucleotide into a cell Typically, the length of a nucleic acid of the present invention 
less the length of its polynucleotide of the present invention is less than 20 kilobase pairs, 
often less than 15 kb, and frequently less than 10 kb. Use of cloning vectors, expression 
vectors, adapters, and linkers is well known in the art. Exemplary nucleic acids include 

5 such vectors as: M13, lambda ZAP Express, lambda ZAP II, lambda gtlO, lambda gtll, 
pBK-CMV, pBK-RSV, pBluescript II, lambda DASH II, lambda EMBL 3, lambda EMBL 
4, pWE15, SuperCos 1, SurfZap, Uni-ZAP, pBC, pBS+A, pSG5, pBK, pCR-Script, pET, 
pSPUTK, p3'SS, pGEM, pSK+/-, pGEX, pSPORTI and II, pOPRSVI CAT, pOPI3 CAT, 
pXTl, pSG5, pPbac, pMbac, pMClneo, pOG44, pOG45, pFRTpGAL, pNEOpGAL, 

10 pRS403, pRS404, pRS405, pRS406, pRS413, pRS414, pRS415, pRS416, lambda 
MOSSlox, and lambda MOSElox. Optional vectors for the present invention, include but 
are not limited to, lambda ZAP II, and pGEX. For a description of various nucleic acids 
see, for example, Stratagene Cloning Systems, Catalogs 1995, 1996, 1997 (La Jolla, CA); 
and, Amersham Life Sciences, Inc, Catalog '97 (Arlington Heights, IL). 

15 

Synthetic Methods for Constructing Nucleic Acids 

The isolated nucleic acids of the present invention can also be prepared by direct 
chemical synthesis by methods such as the phosphotriester method of Narang et al, Meth. 
Enzymol 68: 90-99 (1979); the phosphodiester method of Brown et al, Meth Enzymol 

20 68: 109-151 (1979); the diethylphosphoramidite method of Beaucage et al, Tetra. Lett, 22: 
1859-1862 (1981); the solid phase phosphoramidite triester method described by Beaucage 
and Caruthers, Tetra. Letts. 22(20): 1859-1862 (1981), e.g., using an automated 
synthesizer, e.g., as described in Needham-VanDevanter et al, Nucleic Acids Res., 12: 
6159-6168 (1984); and, the solid support method of US Patent No. 4,458,066. Chemical 

25 synthesis generally produces a single stranded oligonucleotide. This may be converted 
into double stranded DNA by hybridization with a complementary sequence, or by 
polymerization with a DNA polymerase using the single strand as a template. One of skill 
will recognize that while chemical synthesis of DNA is limited to sequences of about 100 
bases, longer sequences may be obtained by the ligation of shorter sequences. 

30 
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UTRs and Codon Preference 

In general, translational efficiency has been found to be regulated by specific 
sequence elements in the 5* non-coding or untranslated region (5 f UTR) of the RNA. 
Positive sequence motifs include translational initiation consensus sequences (Kozak, 

5 Nucleic Acids ites. 15:8125 (1987)) and the 5<G> 7 methyl GpppG RNA cap structure 
(Drummond et al, Nucleic Acids Res. 13:7375 (1985)). Negative elements include stable 
intramolecular 5 ! UTR stem-loop structures (Muesing et al, Cell 48:691 (1987)) and AUG 
sequences or short open reading frames preceded by an appropriate AUG in the 5 5 UTR 
(Kozak, supra, Rao et al, Mol and Cell Biol 8:284 (1988)). Accordingly, the present 

10 invention provides 5' and/or V UTR regions for modulation of translation of heterologous 
coding sequences. 

Further, the polypeptide-encoding segments of the polynucleotides of the present 
invention can be modified to alter codon usage. Altered codon usage can be employed to 
alter translational efficiency and/or to optimize the coding sequence for expression in a 

15 desired host or to optimize the codon usage in a heterologous sequence for expression in 
maize. Codon usage in the coding regions of the polynucleotides of the present invention 
can be analyzed statistically using commercially available software packages such as 
"Codon Preference" available from the University of Wisconsin Genetics Computer Group 
(see Devereaux et al, Nucleic Acids Res. 12: 387-395 (1984)) or MacVector 4.1 (Eastman 

20 Kodak Co., New Haven, Conn.). Thus, the present invention provides a codon usage 
frequency characteristic of the coding region of at least one of the polynucleotides of the 
present invention. The number of polynucleotides (3 nucleotides per amino acid) that can 
be used to determine a codon usage frequency can be any integer from 3 to the number of 
polynucleotides of the present invention as provided herein. Optionally, the 

25 polynucleotides will be full-length sequences. An exemplary number of sequences for 
statistical analysis can be at least 1, 5, 10, 20, 50, or 100. 

Sequence Shuffling 

The present invention provides methods for sequence shuffling using 
30 polynucleotides of the present invention, and compositions resulting therefrom. Sequence 
shuffling is described in PCT publication No. 96/19256. See also, Zhang, J.- H., et al 
Proc. Natl Acad. Sci. USA 94:4504-4509 (1997) and Zhao, et al, Nature Biotech 16:258- 
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261 (1998). Generally, sequence shuffling provides a means for generating libraries of 
polynucleotides having a desired characteristic, which can be selected or screened for. 
Libraries of recombinant polynucleotides are generated from a population of related 
sequence polynucleotides , which comprise sequence regions, which have substantial 

5 sequence identity and can be homologously recombined in vitro or in vivo. The 
population of sequence-recombined polynucleotides comprises a subpopulation of 
polynucleotides which possess desired or advantageous characteristics and which can be 
selected by a suitable selection or screening method. The characteristics can be any 
property or attribute capable of being selected for or detected in a screening system, and 

10 may include properties of: an encoded protein, a transcriptional element, a sequence 
controlling transcription, RNA processing, RNA stability, chromatin conformation, 
translation, or other expression property of a gene or transgene, a replicative element, a 
protein-binding element, or the like, such as any feature which confers a selectable or 
detectable property. In some embodiments, the selected characteristic will be an altered 

15 and/or over the wild-type protein as provided herein. In other embodiments, a 

protein or polynucleotide generated from sequence shuffling will have a substrate binding 
affinity greater than the non-shuffled wild-type polynucleotide. In yet other embodiments, 
a protein or polynucleotide generated from sequence shuffling will have an altered pH 
optimum as compared to the non-shuffled wild-type polynucleotide. The increase in such 

20 properties can be at least 1 10%, 120%, 130%, 140% or greater than 150% of the wild-type 
value. 

Recombinant Expression Cassettes 

The present invention further provides recombinant expression cassettes 
25 comprising a nucleic acid of the present invention. A nucleic acid sequence coding for the 
desired polynucleotide of the present invention, for example a cDNA or a genomic 
sequence encoding a polypeptide long enough to code for an active protein of the present 
invention, can be used to construct a recombinant expression cassette which can be 
introduced into the desired host cell. A recombinant expression cassette will typically 
30 comprise a polynucleotide of the present invention operably linked to transcriptional 
initiation regulatory sequences which will direct the transcription of the polynucleotide in 
the intended host cell, such as tissues of a transformed plant. 
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For example, plant expression vectors may include (1) a cloned plant gene under 
the transcriptional control of 5' and 3' regulatory sequences and (2) a dominant selectable 
marker. Such plant expression vectors may also contain, if desired, a promoter regulatory 
region (e.g., one conferring inducible or constitutive, environmentally- or 

5 developmentally-regulated, or cell- or tissue-specific/selective expression), a transcription 
initiation start site, a ribosome binding site, an RNA processing signal, a transcription 
termination site, and/or a polyadenylation signal. 

A plant promoter fragment can be employed which will direct expression of a 
polynucleotide of the present invention in all tissues of a regenerated plant. Such 

10 promoters are referred to herein as "constitutive" promoters and are active under most 
environmental conditions and states of development or cell differentiation. Examples of 
constitutive promoters include the V- or 2'- promoter derived from T-DNA of 
Agrobacterium tumefaciens, the Smas promoter, the cinnamyl alcohol dehydrogenase 
promoter (US Patent No. 5,683,439), the Nos promoter, the rubisco promoter, the GRP1-8 

15 promoter, the 35S promoter from cauliflower mosaic virus (CaMV), as described in Odell 
et al., (1985), Nature, 313:810-812, rice actin (McElroy et al., (1990), Plant Cell 163- 
171); ubiquitin (Christensen et al., (1992), Plant Mol Biol 12:619-632; and Christensen, 
et al., (1992), Plant Mol Biol 18:675-689); pEMU (Last, et al., (1991), Theor. Appl 
Genet 81:581-588); MAS (Velten et al., (1984), EMBO J, 3:2723-2730); and maize H3 

20 histone (Lepetit et al., (1992), Mol Gen. Genet 231:276-285; and Atanassvoa et ah, 
(1992), Plant Journal 2(3):291-300), the Rsyn7 as described in published PCT Application 
WO 97/44756, ALS promoter, as described in published PCT Application WO 96/30530, 
and other transcription initiation regions from various plant genes known to those of skill. 
For the present invention ubiquitin is the preferred promoter for expression in monocot 

25 plants. 

Alternatively, the plant promoter can direct expression of a polynucleotide of the 
present invention in a specific tissue or may be otherwise under more precise 
environmental or developmental control. Such promoters are referred to here as 
"inducible" promoters. Environmental conditions that may effect transcription by 
30 inducible promoters include pathogen attack, anaerobic conditions, or the presence of light. 
Examples of inducible promoters are the Adhl promoter, which is inducible by hypoxia or 
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cold stress, the Hsp70 promoter, which is inducible by heat stress, and the PPDK promoter, 
which is inducible by light. 

Examples of promoters under developmental control include promoters that initiate 
transcription only, or preferentially, in certain tissues, such as leaves, roots, fruit, seeds, or 
5 flowers. The operation of a promoter may also vary depending on its location in the 
genome. Thus, an inducible promoter may become fully or partially constitutive in certain 
locations. 

If polypeptide expression is desired, it is generally desirable to include a 
polyadenylation region at the 3 '-end of a polynucleotide coding region. The 

10 polyadenylation region can be derived from a variety of plant genes, or from T-DNA. The 
3' end sequence to be added can be derived from, for example, the nopaline synthase or 
octopine synthase genes, or alternatively from another plant gene, or less preferably from 
any other eukaryotic gene. Examples of such regulatory elements include, but are not 
limited to, 3' termination and/or polyadenylation regions such as those of the 

15 Agrobacterium tumefaciens nopaline synthase (nos) gene (Bevan et al., (1983), Nucl Acids 
Res. 12:369-385); the potato proteinase inhibitor II (PINII) gene (Keil, et al., (1986), Nucl 
Acids Res. 14:5641-5650; and An et al., (1989), Plant Cell 1:1 15-122); and the CaMV 19S 
gene (Mogen et al., (1990), Plant Cell 2:1261-1272). 

An intron sequence can be added to the 5 ? untranslated region or the coding 

20 sequence of the partial coding sequence to increase the amount of the mature message that 
accumulates in the cytosol. Inclusion of a spliceable intron in the transcription unit in both 
plant and animal expression constructs has been shown to increase gene expression at both 
the mRNA and protein levels up to 1000-fold. Buchman and Berg, Mol Cell Biol 8: 4395- 
4405 (1988); Callis et al, Genes Dev. 1: 1183-1200 (1987). Such intron enhancement of 

25 gene expression is typically greatest when placed near the 5' end of the transcription unit. 
Use of maize introns Adhl-S intron 1, 2, and 6, the Bronze- 1 intron are known in the art. 
See generally, The Maize Handbook, Chapter 116, Freeling and Walbot, Eds., Springer, 
New York (1994). 

Plant signal sequences, including, but not limited to, signal-peptide encoding 
30 DNA/RNA sequences which target proteins to the extracellular matrix of the plant cell 
(Dratewka-Kos, et al., (1989), J. Biol Chem. 264:4896-4900), the Nicotiana 
plumbaginifolia extension gene (DeLoose, et al., (1991), Gene 99:95-100), signal peptides 
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which target proteins to the vacuole like the sweet potato sporamin gene (Matsuka, et al., 

(1991) , PNAS 88:834) and the barley lectin gene (Wilkins, et al., (1990), Plant Cell 2:301- 
313), signal peptides which cause proteins to be secreted such as that of PRIb (Lind, et al., 

(1992) , Plant Mol Biol 18:47-53), or the barley alpha amylase (BAA) (Rahmatullah, et 
5 al, Plant Mol Biol 12:119 (1989)) and hereby incorporated by reference), or from the 

present invention the signal peptide from the ESP1 or BEST1 gene, or signal peptides 
which target proteins to the plastids such as that of rapeseed enoyl-Acp reductase 
(Verwaert, et al., (1994), Plant Mol Biol 26:189-202) are useful in the invention. The 
barley alpha amylase signal sequence operably linked to the trAPAO or APAO 

10 polynucleotide is the preferred construct for expression in maize for the present invention. 

The vector comprising the sequences from a polynucleotide of the present 
invention will typically comprise a marker gene, which confers a selectable phenotype on 
plant cells. Usually, the selectable marker gene will encode antibiotic resistance, with 
suitable genes including genes coding for resistance to the antibiotic spectinomycin (e.g., 

15 the aada gene), the streptomycin phosphotransferase (SPT) gene coding for streptomycin 
resistance, the neomycin phosphotransferase (NPTII) gene encoding kanamycin or 
geneticin resistance, the hygromycin phosphotransferase (HPT) gene coding for 
hygromycin resistance, genes coding for resistance to herbicides which act to inhibit the 
action of acetolactate synthase (ALS), in particular the sulfonylurea-type herbicides (e.g., 

20 the acetolactate synthase (ALS) gene containing mutations leading to such resistance in 
particular the S4 and/or Hra mutations), genes coding for resistance to herbicides which act 
to inhibit action of glutamine synthase, such as phosphinothricin or basta (e.g., the bar 
gene), or other such genes known in the art. The bar gene encodes resistance to the 
herbicide basta, and the ALS gene encodes resistance to the herbicide chlorsulfuron. 

25 Alternatively, the invention, itself, could be used as a method for selection of 

transformants, in other words as a selectable marker. An APAO or trAPAO 
polynucleotide operably linked to a promoter and then transformed into a plant cell by any 
of the methods described in the present application would express the degradative enzyme. 
When the plant cells are placed in the presence of fumonisin, API, or a phytotoxic analog 

30 in culture only the transformed cells would be able to grow. In another embodiment, the 
plant cell could be transformed with both a polynucleotide for APAO and a polynucleotide 
for fumonisin esterase. The selective agent in this case could be either API or fumonisin 
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or any structural analog. Thus, growth of plant cells in the presence of a mycotoxin favors 
the survival of plant cells that have been transformed to express the coding sequence that 
codes for one of the enzymes of this invention and degrades the toxin. When the AP AO or 
trAPAO cassette with or without the fumonisin esterase polynucleotide, is co-transformed 

5 with another gene of interest and then placed in the presence of fumonisin, API or a 
phytotoxic analog, this invention would allow for selection of only those plant cells that 
contain the gene of interest. In the past antibiotic resistance genes have been used as 
selectable markers. Given the current concerns by consumers and environmentalist over 
use of antibiotic genes and the possibility of resistant microorganisms arising due to this 

10 use, a non-antibiotic resistant selectable marker system such as the present invention, 
fulfills this very important need. 

Typical vectors useful for expression of genes in higher plants are well known in 
the art and include vectors derived from the tumor-inducing (Ti) plasmid of Agrobacterium 
tumefaciens described by Rogers et al, Meth. In Enzymol., 153:253-277 (1987). These 

15 vectors are plant integrating vectors in that on transformation, the vectors integrate a 
portion of vector DNA into the genome of the host plant. Exemplary A. tumefaciens 
vectors useful herein are plasmids pKYLX6 and pKYLX7 of Schardl et ah, Gene, 61:1-1 1 
(1987) and Berger et aL 9 Proc. Natl. Acad. Sci. U.S.A., 86:8402-8406 (1989). Another 
useful vector herein is plasmid pBI 101.2 that is available from CLONTECH Laboratories, 

20 Inc. (Palo Alto, CA). 

Expression of Proteins in Host Cells 

Using the nucleic acids of the present invention, one may express a protein of the 
present invention in a recombinantly engineered cell such as bacteria, yeast, insect, 
25 mammalian, or preferably plant cells. The cells produce the protein in a non-natural 
condition (e.g., in quantity, composition, location, and/or time), because they have been 
genetically altered through human intervention to do so. 

It is expected that those of skill in the art are knowledgeable in the numerous 
expression systems available for expression of a nucleic acid encoding a protein of the 
30 present invention. No attempt to describe in detail the various methods known for the 
expression of proteins in prokaryotes or eukaryotes will be made. 
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In brief summary, the expression of isolated nucleic acids encoding a protein of the 
present invention will typically be achieved by operably linking, for example, the DNA or 
cDNA to a promoter (which is either constitutive or inducible), followed by incorporation 
into an expression vector. The vectors can be suitable for replication and integration in 

5 either prokaryotes or eukaryotes. Typical expression vectors contain transcription and 
translation terminators, initiation sequences, and promoters useful for regulation of the 
expression of the DNA encoding a protein of the present invention. To obtain high level 
expression of a cloned gene, it is desirable to construct expression vectors which contain, 
at the minimum, a strong promoter, such as ubiquitin, to direct transcription, a ribosome 

10 binding site for translational initiation, and a transcription/translation terminator. 
Constitutive promoters are classified as providing for a range of constitutive expression. 
Thus, some are weak constitutive promoters, and others are strong constitutive promoters. 
Generally, by "weak promoter" is intended a promoter that drives expression of a coding 
sequence at a low level. By "low level" is intended at levels of about 1/10,000 transcripts 

15 to about 1/100,000 transcripts to about 1/500,000 transcripts. Conversely, a "strong 
promoter" drives expression of a coding sequence at a "high level", or about 1/10 
transcripts to about 1/100 transcripts to about 1/1,000 transcripts. 

One of skill would recognize that modifications could be made to a protein of the 
present invention without diminishing its biological activity. Some modifications may be 

20 made to facilitate the cloning, expression, or incorporation of the targeting molecule into a 
fusion protein. Such modifications are well known to those of skill in the art and include, 
for example, a methionine added at the amino terminus to provide an initiation site, or 
additional amino acids (e.g., poly His) placed on either terminus to create conveniently 
located restriction sites or termination codons or purification sequences. 

25 

A. Expression in Prokaryotes 

Prokaryotic cells may be used as hosts for expression. Prokaryotes most frequently 
are represented by various strains of E. coli; however, other microbial strains may also be 
used. Commonly used prokaryotic control sequences which are defined herein to include 
30 promoters for transcription initiation, optionally with an operator, along with ribosome 
binding site sequences, include such commonly used promoters as the beta lactamase 
(penicillinase) and lactose (lac) promoter systems (Chang et al, Nature 198:1056 (1977)), 
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the tryptophan (trp) promoter system (Goeddel et al., Nucleic Acids Res. 8:4057 (1980)) 
and the lambda derived P L promoter and N-gene ribosome binding site (Shimatake et aL, 
Nature 292:128 (1981)). The inclusion of selection markers in DNA vectors transfected in 
E. coli is also useful. Examples of such markers include genes specifying resistance to 

5 ampicillin, tetracycline, or chloramphenicol. 

The vector is selected to allow introduction of the gene of interest into the 
appropriate host cell. Bacterial vectors are typically of plasmid or phage origin. 
Appropriate bacterial cells are infected with phage vector particles or transfected with 
naked phage vector DNA. If a plasmid vector is used, the bacterial cells are transfected 

10 with the plasmid vector DNA. Expression systems for expressing a protein of the present 
invention are available using Bacillus sp. and Salmonella (Palva, et aL y Gene 22: 229-235 
(1983); Mosbach, et al 9 Nature 302: 543-545 (1983)). The pGEX-4T-l plasmid vector 
from Pharmacia is the preferred E. coli expression vector for the present invention. 

15 B. Expression in Eukaryotes 

A variety of eukaryotic expression systems such as yeast, insect cell lines, plant and 
mammalian cells, are known to those of skill in the art. As explained briefly below, the 
present invention can be expressed in these eukaryotic systems. In some embodiments, 
transformed/transfected plant cells, as discussed infra, are employed as expression systems 

20 for production of the proteins of the instant invention. 

Synthesis of heterologous proteins in yeast is well known. Sherman, F., et al, 
Methods in Yeast Genetics, Cold Spring Harbor Laboratory (1982) is a well recognized 
work describing the various methods available to produce the protein in yeast. Two widely 
utilized yeasts for production of eukaryotic proteins are Saccharomyces cerevisiae and 

25 Pichia pastoris. Vectors, strains, and protocols for expression in Saccharomyces and 
Pichia are known in the art and available from commercial suppliers (e.g., Invitrogen). 
Suitable vectors usually have expression control sequences, such as promoters, including 
3-phosphoglycerate kinase or alcohol oxidase, and an origin of replication, termination 
sequences and the like as desired. 

30 A protein of the present invention, once expressed, can be isolated from yeast by 

lysing the cells and applying standard protein isolation techniques to the lysates or the 
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pellets. The monitoring of the purification process can be accomplished by using Western 
blot techniques or radioimmunoassay of other standard immunoassay techniques. 

The sequences encoding proteins of the present invention can also be ligated to 
various expression vectors for use in transfecting cell cultures of, for instance, mammalian, 

5 insect, or plant origin. Mammalian cell systems often will be in the form of monolayers of 
cells although mammalian cell suspensions may also be used. A number of suitable host 
cell lines capable of expressing intact proteins have been developed in the art, and include 
the HEK293, BHK21, and CHO cell lines. Expression vectors for these cells can include 
expression control sequences, such as an origin of replication, a promoter (e.g., the CMV 

10 promoter, a HSV tk promoter or pgk (phosphogly cerate kinase) promoter), an enhancer 
(Queen et al, Immunol Rev. 89: 49 (1986)), and necessary processing information sites, 
such as ribosome binding sites, RNA splice sites, polyadenylation sites (e.g., an SV40 
large T Ag poly A addition site), and transcriptional terminator sequences. Other animal 
cells useful for production of proteins of the present invention are available, for instance, 

15 from the American Type Culture Collection Catalogue of Cell Lines and Hybridomas (7th 
edition, 1992). 

Appropriate vectors for expressing proteins of the present invention in insect cells 
are usually derived from the SF9 baculovirus. Suitable insect cell lines include mosquito 
larvae, silkworm, armyworm, moth, and Drosophila cell lines such as a Schneider cell line 

20 (See Schneider, J. Embryo!. Exp. Morphol 27: 353-365 (1987). 

As with yeast, when higher animal or plant host cells are employed, 
polyadenlyation or transcription terminator sequences are typically incorporated into the 
vector. An example of a terminator sequence is the polyadenlyation sequence from the 
bovine growth hormone gene. Sequences for accurate splicing of the transcript may also 

25 be included. An example of a splicing sequence is the VP1 intron from SV40 (Sprague, et 
al, J. Virol 45: 773-781 (1983)). Additionally, gene sequences to control replication in 
the host cell may be incorporated into the vector such as those found in bovine papilloma 
virus type-vectors. Saveria-Campo, M., Bovine Papilloma Virus DNA a Eukaryotic 
Cloning Vector in DNA Cloning Vol II a Practical Approach D.M. Glover, Ed., IRL Press, 

30 Arlington, Virginia pp. 213-238 (1985). 

In addition, one of the genes for fumonisin esterase or the APAO or trAPAO placed 
in the appropriate plant expression vector can be used to transform plant cells. The 
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enzyme can then be isolated from plant callus or the transformed cells can be used to 
regenerate transgenic plants. Such transgenic plants can be harvested, and the appropriate 
tissues (seed or leaves, for example) can be subjected to large scale protein extraction and 
purification techniques, and the fumonisin degradation enzymes or APAO can be isolated 
5 for use in fumonisin and fumonisin hydrolysis product detoxification processes. 

Plant Transformation Methods 

Numerous methods for introducing foreign genes into plants are known and can be 
used to insert an APAO or trAPAO polynucleotide into a plant host, including biological 

10 and physical plant transformation protocols. See, for example, Miki et aL, (1993), 
"Procedure for Introducing Foreign DNA into Plants", In: Methods in Plant Molecular 
Biology and Biotechnology, Glick and Thompson, eds., CRC Press, Inc., Boca Raton, 
pages 67-88. The methods chosen vary with the host plant, and include chemical 
transfection methods such as calcium phosphate, microorganism-mediated gene transfer 

15 such as Agrobacterium (Horsch, et aL, (1985), Science 227:1229-31), electroporation, 
micro-injection, and biolistic bombardment 

Expression cassettes and vectors and in vitro culture methods for plant cell or tissue 
transformation and regeneration of plants are known and available. See, for example, 
Gruber, et aL, (1993), "Vectors for Plant Transformation" In: Methods in Plant Molecular 

20 Biology and Biotechnology, Glick and Thompson, eds. CRC Press, Inc., Boca Raton, pages 
89-119. 

Agrobacterium-medistted Transformation 

The most widely utilized method for introducing an expression vector into plants is 
25 based on the natural transformation system of Agrobacterium. A. tumefaciens and A. 
rhizogenes are plant pathogenic soil bacteria, which genetically transform plant cells. The 
Ti and Ri plasmids of A. tumefaciens and A. rhizogenes, respectively, carry genes 
responsible for genetic transformation of plants. See, for example, Kado, (1991), Crit. 
Rev. Plant Sci. 10:1. Descriptions of the Agrobacterium vector systems and methods for 
30 Agrobacterium-medmted gene transfer are provided in Gruber et aL, supra; Miki, et aL, 
supra; and Moloney et aL, (1989), Plant Cell Reports 8:238. 
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Similarly, the gene can be inserted into the T-DNA region of a Ti or Ri plasmid 
derived from A. tumefaciens or A. rhizogenes, respectively. Thus, expression cassettes can 
be constructed as above, using these plasmids. Many control sequences are known which 
when coupled to a heterologous coding sequence and transformed into a host organism 

5 show fidelity in gene expression with respect to tissue/organ specificity of the original 
coding sequence. See, e.g., Benfey, P. N., and Chua, N. H. (1989) Science 244: 174-181. 
Particularly suitable control sequences for use in these plasmids are promoters for 
constitutive leaf-specific expression of the gene in the various target plants. Other useful 
control sequences include a promoter and terminator from the nopaline synthase gene 

10 (NOS). The NOS promoter and terminator are present in the plasmid pARC2, available 
from the American Type Culture Collection and designated ATCC 67238. If such a 
system is used, the virulence (vir) gene from either the Ti or Ri plasmid must also be 
present, either along with the T-DNA portion, or via a binary system where the vir gene is 
present on a separate vector. Such systems, vectors for use therein, and methods of 

15 transforming plant cells are described in US Pat. No. 4,658,082; US application Ser. No. 
913,914, filed Oct. 1, 1986, as referenced in US Patent 5,262,306, issued November 16, 
1993 to Robeson, et aL; and Simpson, R. B., et al. (1986) Plant MoL Biol 6: 403-415 (also 
referenced in the '306 patent); all incorporated by reference in their entirety. 

Once constructed, these plasmids can be placed into A. rhizogenes or A. 

20 tumefaciens and these vectors used to transform cells of plant species, which are ordinarily 
susceptible to Fusarium or Alternaria infection. Several other transgenic plants are also 
contemplated by the present invention including but not limited to soybean, corn, sorghum, 
alfalfa, rice, clover, cabbage, banana, coffee, celery, tobacco, cowpea, cotton, melon and 
pepper. The selection of either A. tumefaciens or A. rhizogenes will depend on the plant 

25 being transformed thereby. In general A. tumefaciens is the preferred organism for 
transformation. Most dicotyledonous plants, some gymnosperms, and a few 
monocotyledonous plants (e.g. certain members of the Liliales and Arales) are susceptible 
to infection with A. tumefaciens, A. rhizogenes also has a wide host range, embracing most 
dicots and some gymnosperms, which includes members of the Leguminosae, Compositae, 

30 and Chenopodiaceae. Monocot plants can now be transformed with some success. 
European Patent Application Publication Number 604 662 Al to Hiei et al discloses a 
method for transforming monocots using Agrobacterium. Saito et al discloses a method 



37 



for transforming monocots with Agrobacterium using the scutellum of immature embryos 
(European Application 672 752 Al). Ishida et al discusses a method for transforming 
maize by exposing immature embryos to A tumefaciens (Ishida et al, Nature 
Biotechnology, 1996, 14:745-750). 

5 Once transformed, these cells can be used to regenerate transgenic plants, capable 

of degrading fumonisin. For example, whole plants can be infected with these vectors by 
wounding the plant and then introducing the vector into the wound site. Any part of the 
plant can be wounded, including leaves, stems and roots. Alternatively, plant tissue, in the 
form of an explant, such as cotyledonary tissue or leaf disks, can be inoculated with these 

10 vectors, and cultured under conditions, which promote plant regeneration. Roots or shoots 
transformed by inoculation of plant tissue with A, rhizogenes or A, tumefaciens, containing 
the gene coding for the fumonisin degradation enzyme, can be used as a source of plant 
tissue to regenerate fumonisin-resistant transgenic plants, either via somatic embryogenesis 
or organogenesis. Examples of such methods for regenerating plant tissue are disclosed in 

15 Shahin, E. A. (1985) Theor. Appl Genet 69:235-240; US Pat. No. 4,658,082; Simpson, R. 
B., et al. (1986) Plant Mol Biol 6: 403-415; and U.S. patent applications Ser. Nos. 
913,913 and 913,914, both filed Oct. 1, 1986, as referenced in U.S. Patent 5,262,306, 
issued November 16, 1993 to Robeson, et al.; the entire disclosures therein incorporated 
herein by reference. 

20 

Direct Gene Transfer 

Despite the fact that the host range for Agrobacterium-medizted transformation is 
broad, some major cereal crop species and gymnosperms have generally been recalcitrant 
to this mode of gene transfer, even though some success has recently been achieved in rice 
25 (Hiei et al., (1994), The Plant Journal 6:271-282). Several methods of plant 
transformation, collectively referred to as direct gene transfer, have been developed as an 
alternative to Agrobacterium-mtdieLted transformation. 

A generally applicable method of plant transformation is microprojectile-mediated 
transformation, where DNA is carried on the surface of microprojectiles measuring about 1 
30 to 4 jam. The expression vector is introduced into plant tissues with a biolistic device that 
accelerates the microprojectiles to speeds of 300 to 600 m/s which is sufficient to penetrate 
the plant cell walls and membranes. (Sanford et al., (1987), Part Set Technol 5:27; 
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Sanford, 1988, Trends Biotech 6:299; Sanford, (1990), Physiol Plant 79:206; Klein et al., 
(1992), Biotechnology 10:268). 

Another method for physical delivery of DNA to plants is sonication of target cells 
as described in Zang et al., (1991), BioTechnology 9:996. Alternatively, liposome or 

5 spheroplast fusions have been used to introduce expression vectors into plants. See, for 
example, Deshayes et al., (1985), EMBO J. 4:2731; and Christou et al., (1987), PNAS USA 
84:3962. Direct uptake of DNA into protoplasts using CaCl 2 precipitation, polyvinyl 
alcohol, or poly-L-ornithine has also been reported. See, for example, Hain et al., (1985), 
Mol Gen, Genet. 199:161; and Draper et al., (1982), Plant Cell Physiol 23:451. 

10 Electroporation of protoplasts and whole cells and tissues has also been described. 

See, for example, Donn et al., (1990), In: Abstracts of the Vllth Ml Congress on Plant 
Cell and Tissue Culture IAPTC, A2-38, page 53; D'Halluin et aL, (1992), Plant Cell 
4:1495-1505; and Spencer etal., (1994), Plant Mol Biol 24:51-61. 

Thus, polynucleotide encoding a polypeptide able to degrade fumonisin or API can 

15 be isolated and cloned in an appropriate vector and inserted into an organism normally 
sensitive to the Fusarium or its toxins. Furthermore, the polynucleotide imparting 
fumonisin or API degradative activity can be transferred into a suitable plasmid, and 
transformed into a plant. Thus, a fumonisin or API degrading transgenic plant can be 
produced. Organisms expressing the polynucleotide can be easily identified by their 

20 ability to degrade fumonisin or API. The protein capable of degrading fumonisin or API 
can be isolated and characterized using techniques well known in the art. 

APAO or trAPAO in a Transgenic Plant 

Fumonisin esterase reduces but does not eliminate the toxicity of fumonisins. 

25 Therefore a second en2ymatic modification to further reduce or abolish toxicity is 
desirable. The partially purified APAO enzyme from Exophiala spinifera has little or no 
activity on intact FBI, a form of fumonisin. However, recombinant APAO enzyme from 
Exophiala spinifera, expressed in E. coli, has significant but reduced activity on intact FBI 
and other B-series fumonisins. APAO or trAPAO thus could potentially be used without 

30 fumonisin esterase since the amine group is the major target for detoxification. 
Alternatively, the two genes, fumoninsin esterase and APAO (or trAPAO) can be used 
together for degrading toxins. 
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APAO is predicted to be an enzyme that, when by itself or co-expressed in a 
heterologous expression system along with fumonisin esterase (either ESP1 or BEST1), 
will result in the production of 2-oxo-FBl and/or 2-oxo pentol (2-OP) from fumonisin Bl. 
The substrate range of recombinant, E, co/z-expressed APAO is limited to fumonisins and 

5 their hydrolysis products and does not include amino acids, sphingolipid precursors such 
as phytosphingosine, or polyamines such as spermidine. Thus, APAO is highly specific 
for fumonisin-like amines, and thus would have little deleterious effect on other cellular 
metabolites. In addition, if it is extracellularly localized, it will limit any contact with 
biologically important amines that might also be substrates. The end result will be a more 

10 effective detoxification of fumonisins than can be achieved with esterase alone. 

The oxidase activity of APAO is predicted to result in generation of hydrogen peroxide 
in stoichiometric amounts relative to API or fumonisin oxidized. This may prove to be an 
additional benefit of this enzyme, since hydrogen peroxide is both antimicrobial and is 
thought to contribute to the onset of a defense response in plants (Przemylaw, Biochem J., 

15 322:681-692 (1997), Lamb, et al, Ann Rev Plant Physiol Plant Mol Bio 48:251-275 
(1997), and Alverez, et al., Oxidative Stress and the Molecular Biology of Antioxidant 
Defenses, Cold Spring Harbor Press, 815-839 (1997)). 

Because one of the embodiments of the present invention is to have both a fumonisin 
esterase polynucleotide and an APAO or trAPAO polynucleotide present in a plant, there 

20 are several ways to introduce more than one polynucleotide in a plant. One way is to 
transform plant tissue with polynucleotides to both fumonisin esterase and APAO or 
trAPAO at the same time. In some tissue culture systems it is possible to transform callus 
with one polynucleotide and then after establishing a stable culture line containing the first 
polynucleotide, transform the callus a second time with the second polynucleotide. One 

25 could also transform plant tissue with one polynucleotide, regenerate whole plants, then 
transform the second polynucleotide into plant tissue and regenerate whole plants. The 
final step would then be to cross a plant containing the first polynucleotide with a plant 
containing the second polynucleotide and select for progeny containing both 
polynucleotides. 

30 Another method is to create a fusion protein between esterase and APAO or trAPAO, 

preferably with a spacer region between the two polypeptides. Both enzymes would be 
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active although tethered to each other. In addition, an enzyme cleavage site engineered in 
the spacer region, would allow cleavage by an endogenous or introduced protease. 

Transgenic plants containing both a fumonisin esterase enzyme and/or the APAO 
enzyme and thus able to degrade fumonisin or a structurally related mycotoxin would be 

5 able to reduce or eliminate the pathogenicity of any microorganism that uses fumonisin or 
a structurally related mycotoxin as a mode of entry to infect a plant. Fungal pathogens 
frequently use toxins to damage plants and weaken cell integrity in order to gain entry and 
expand infection in a plant. By preventing the damage induced by a toxin, a plant would 
be able to prevent the establishment of the pathogen and thereby become tolerant or 

1 0 resistant to the pathogen. 

Another benefit of fumonisin degradation is the production of hydrogen peroxide. 
When fumonisin or API is oxididatively deaminated at C-2, as occurs by exposure to 
APAO or trAPAO enzyme, hydrogen peroxide is produced as a by-product. Hydrogen 
peroxide production can trigger enhanced resistance responses in a number of ways. 1) 

15 Hydrogen peroxide has direct antimicrobial activity. 2) Hydrogen peroxide acts as a 
substrate for peroxidases associated with lignin polymerization and hence cell wall 
strengthening. 3) Via still to be determined mechanisms, hydrogen peroxide acts as a 
signal for activation of expression of defense related genes, including those that result in 
stimulation of salicylic acid accumulation. Salicylic acid is thought to act an endogenous 

20 signal molecule that triggers expression of genes coding for several classes of 
pathogenesis-related proteins. Moreover, salicylic acid may set up the oxidative burst and 
thus act in a feedback loop enhancing its own synthesis. Salicylic acid may also be 
involved in hypersensitive cell death by acting as an inhibitor of catalase, an enzyme that 
removes hydrogen peroxide. 4) Hydrogen peroxide may trigger production of additional 

25 defense compounds such as phytoalexins, antimicrobial low molecular weight compounds. 
For a review on the role of the oxidative burst and SA please see Lamb, C. and Dixon, 
R.A, Ann. Rev, Plant Physiol Plant Mol Biol, 48: 251-275 (1997). 

Detoxification of Harvested Grain, Silage, or Contaminated Food Crop 

30 The present invention also relates to a method of detoxifying a fumonisin or a 

structurally related mycotoxin with an APAO enzyme during the processing of grain for 
animal or human food consumption, during the processing of plant material for silage, or 
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food crops contaminated with a toxin producing microbe, such as but not limited to, 
tomato. Since the atmospheric ammoniation of corn has proven to be an ineffective 
method of detoxification (see B. Fitch Haumann, INFORM 6:248-257 (1995)), such a 
methodology during processing is particularly critical where transgenic detoxification is 
5 not applicable. 

In one embodiment of the present invention, fumonisin degradative enzymes are 
presented to grain, plant material for silage, or a contaminated food crop, or during the 
processing procedure, at the appropriate stages of the procedure and in amounts effective 
for detoxification of fumonisins and structurally related mycotoxins. Detoxification by the 

10 enzymes, microbial strains, or an engineered microorganism can occur not only during the 
processing, but also any time prior or during the feeding of the grain or plant material to an 
animal or incorporation of the grain or food crop into a human food product, or before or 
during ingestion of the food crop. 

Another embodiment of the present invention is the engineering of a bacterium or 

15 fungus to express the detoxification enzymes and then using the bacterium or fungus rather 
than the enzyme itself. There are a number of microbes that could be engineered to 
express the polynucleotides of the present invention. One could also activate, either 
inducibly or constitutively, the endogenous genes for fumonisin esterase or APAO. By 
overexpressing the degradative enzymes and then treating plants, seed, or silage with the 

20 microorganism, it would be possible to degrade fumonisin in situ. 

The polynucleotides of the invention can be introduced into microorganisms that 
multiply on plants (epiphytes) to deliver enzymes to potential target crops. Epiphytes can 
be gram-positive or gram-negative bacteria, for example. 

The microorganisms that have been genetically altered to contain at least one 

25 degradative polynucleotide and resulting polypeptide may be used for protecting 
agricultural crops and products. In one aspect of the invention, whole, i.e. unlysed, cells of 
the transformed organism are treated with reagents that prolong the activity of the enzyme 
produced in the cell when the cell is applied to the environment of a target plant. A 
secretion leader may be used in combination with the gene of interest such that the 

30 resulting enzyme is secreted outside the host cell for presentation to the target plant. 

The degradative enzymes can be fermented in a bacterial host and the resulting 
bacteria processed and used as a microbial spray. Any suitable microorganism can be used 
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for this purpose. See, for example, Gaertner, et al (1993) in Advanced Engineered 
Pesticides, (ed. Kim, Marcel Dekker, New York). 

The enzymes or microorganisms can be introduced during processing in 
appropriate manners, for example as a wash or spray, or in dried or lyophilized form or 
5 powered form, depending upon the nature of the milling process and/or the stage of 
processing at which the enzymatic treatment is carried out. See generally, Hoseney, R.C., 
Principles of Cereal Science and Technology, American Assn. of Cereal Chemists, Inc., 
1990 (especially Chapters 5, 6 and 7); Jones, J.M., Food Safety, Eagan Press, St. Paul, 
MN, 1992 (especially Chapters 7 and 9); and Jelen, P., Introduction to Food Processing, 

10 Restan Publ. Co., Reston, VA, 1985. Processed grain or silage to be used for animal feed 
can be treated with an effective amount of the enzymes in the form of an inoculant or 
probiotic additive, for example, or in any form recognized by those skilled in the art for 
use in animal feed. The enzymes of the present invention are expected to be particularly 
useful in detoxification during processing and/or in animal feed prior to its use, since the 

15 enzymes display relatively broad ranges of pH activity. The esterase from Exophiala 
spinifera, ATCC 74269, showed a range of activity from about pH 3 to about pH 6, and the 
esterase from the bacterium of ATCC 55552 showed a range of activity from about pH 6 to 
about pH 9 (US patent no. 5,716,820, supra). The APAO enzyme from Exophiala 
spinifera (ATCC 74269) has a pH range of activity from pH 6 to pH 9. 

20 

Genetic Engineering of Ruminant Microorganisms 

Ruminant microorganisms can be genetically engineered to contain and express 
either the fumonisin esterase enzymes or APAO, or a combination of the enzymes. The 
genetic engineering of microorganisms is now an art recognized technique, and ruminant 
25 microorganisms so engineered can be added to feed in any art recognized manner, for 
example as a probiotic or inoculant. In addition, microorganisms capable of functioning as 
bioreactors can be engineered so as to be capable of mass producing either the fumonisin 
esterases or the APAO enzyme. 
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Use of the Fumonisin Esterase and APAO Enzymes for Detection of Reagents for 
Fumonisins and Related Compounds 

Another embodiment of the present invention is the use of the enzymes of the 
present invention as detection reagents for fumonisins and related compounds. The 
5 en2ymes of the present invention can be used as detection reagents because of the high 
specificity of the esterase and deaminase enzymes, and the fact that hydrolysis followed by 
amine oxidation can be monitored by detection of hydrogen peroxide or ammonia using 
standard reagents (analogous to a glucose detection assay using glucose oxidase). 
Hydrogen peroxide is often measured by linking a hydrogen peroxide-dependent 

10 peroxidase reaction to a colored or otherwise detectable peroxidase product (e.g. 
Demmano, et al } European Journal of Biochemistry 238(3): 785-789 (1996)). Ammonia 
can be measured using ion-specific electrodes ; Fritsche, et at, Analytica Chimica Acta 
244(2): 179-182 (1991); West et al, Analytical Chemistry 64(5): 533-540 (1992), and all 
herein incorporated by reference) or by GC or other chromatographic method. 

15 For example, recombinant or non-recombinant, active fumonisin esterase (ESP1 or 

BEST) and APAO proteins are added in catalytic amounts to a sample tube containing an 
unknown amount of fumonisins (FBI, FB2, FB3, FB4, or partial or complete hydrolysis 
products of these). The tube is incubated under pH and temperature conditions sufficient to 
convert any fumonisin in the sample to API or to 2-oxo-FBl, and correspondingly the 

20 API to 2-OP, ammonia, and hydrogen peroxide. Alternatively, APAO or trAPAO is added 
in catalytic amounts to a sample tube containing an unknown amount of fumonisins (FBI, 
FB2, FB3, FB4, or partial or complete hydrolysis products of these). The tube is incubated 
under pH and temperature conditions sufficient to convert any fumonisin in the sample to 
2-oxo FBI, ammonia, and hydrogen peroxide. Then suitable reagents are added for 

25 quantification of the hydrogen peroxide or ammonia that were generated stoichiometrically 
from fumonisins. By comparison with control tubes that received no esterase or APAO 
enzyme, the amount of fumonisin present can be calculated in direct molar proportion to 
the hydrogen peroxide or ammonia detected, relative to a standard curve. 

This invention can be better understood by reference to the following non-limiting 

30 examples. It will be appreciated by those skilled in the art that other embodiments of the 
invention may be practiced without departing from the spirit and the scope of the invention 
as herein disclosed and claimed. 
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Example 1 

Fungal and bacterial isolates* Exophiala isolates from maize were isolated as 
described in US patent no. 5,716,820, issued February 10, 1998; US patent no. 6,025,188, 
issued February 15, 2000; and pending US application no. 08/888,950, filed July 7, 1997, 
and herein incorporated by reference. 

Isolation methods. Direct isolation of black yeasts from seed was accomplished by 
plating 100 microliters of seed wash fluid onto YPD or Sabouraud agar augmented with 
cycloheximide (500 mg/liter) and chloramphenicol (50 mg/liter). Plates were incubated at 
room temperature for 7-14 days, and individual pigmented colonies that arose were 
counted and cultured for analysis of fumonisin-degrading ability as described in US patent 
no. 5,716,820, issued February 10, 1998; US patent no. 6,025,188, issued February 15, 
2000; and pending US application no. 08/888,950, filed July 7, 1997. 

Analysis of fumonisins and metabolism products. Analytical thin-layer 
chromatography was carried out on 100% silanized CI 8 silica plates (Sigma #T-7020; 10 x 
10 cm; 0.1 mm thick) by a modification of the published method of Rottinghaus 
(Rottinghaus, et al 9 J Vet Diagn Invest, 4: 326 (1992), and herein incorporated by 
reference). 

To analyze fumonisin esterase activity sample lanes were pre-wet with methanol to 
facilitate sample application. After application of from 0.1 to 2 \xl of aqueous sample, the 
plates were air-dried and developed in MeOH:4% KC1 (3:2) or MeOH:0.2 M KOH (3:2) 
and then sprayed successively with 0.1 M sodium borate (pH 9.5) and fluorescamine (0.4 
mg/ml in acetonitrile). Plates were air-dried and viewed under long wave UV. 

For analysis of APAO activity, an alternative method was used. Equal volumes of 
sample and 14 C-AP1 (1 mg/ml, pH 8, 50 mM sodium phosphate) were incubated at room 
temperature for one to six days. Analytical thin-layer chromatography was then carried out 
on C60 HPK silica gel plates (Whatman #4807-700; 10x10 cm; 0.2 mm thick). After 
application of from 0.1 to 2 jal of aqueous sample, the plates were air-dried and developed 
in CHCl 3 :MeOH:CH 3 COOH:H 2 0 (55:36:8:1). Plates were then air dried, and exposed to 
Phosphorlmager screen (Molecular Dynamics) or autoradiographic film. A Storm™ 
Phosphorlmager (Molecular Dynamics) was used to scan the image produced on the 
screen. 
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Alkaline hydrolysis of FBI to API, FBI or crude fumonisin C g material was 
suspended in water at 10-100 mg/ml and added to an equal volume of 4 N NaOH in a 
screw-cap tube. The tube was sealed and incubated at 60°C for 1 hr. The hydrolysate was 
cooled to RT and mixed with an equal volume of ethyl acetate, centrifuged at 1000 RCF 
5 for 5 minute and the organic (upper) layer recovered. The pooled ethyl acetate layers from 
two successive extractions were dried under N 2 and resuspended in distilled H 2 0. The 
resulting material (the aminopentol of FBI or "API") was analyzed by TLC. 

Enzyme activity of culture filtrate and mycelium. Exophiala spinifera isolate 
2141.10 was grown on YPD agar for 1 week, and conidia were harvested, suspended in 

10 sterile water, and used at 10 5 conidia per ml to inoculate sterile Fries mineral salts medium 
containing 1 mg/ml purified FBI (Sigma Chemical Co.). After 2 weeks incubation at 28° 
C in the dark, cultures were filtered through 0.45 micron cellulose acetate filters, and 
rinsed with Fries mineral salts. Fungal mycelium was suspended in 15 mL of 0.1% FBI, 
pH 5.2 + 1 mM EDTA + 3 |ug/mL Pepstatin A + 1.5 jag/mL Leupeptin and disrupted in a 

15 Bead Beater™ using 0.1 mm beads and one minute pulses, with ice cooling. Hyphal 
pieces were collected by filtering through Spin X™ (0.22 jam), and both mycelial 
supernatant and original culture filtrates were assayed for fumonisin modification by 
methods outlined above. 

Preparation of crude culture filtrate. Agar cultures grown as above were used to 

20 inoculate YPD broth cultures (500 ml) in conical flasks at a final concentration of 10 5 
conidia per ml culture. Cultures were incubated 5 days at 28°C without agitation and 
mycelia harvested by filtration through 0.45 micron filters under vacuum. The filtrate was 
discarded and the mycelial mat was washed and resuspended in sterile carbon-free, low 
mineral salts medium (1 g/liter NH 3 N0 4 ; 1 g/liter NaH 2 P0 4 ; 0.5 g/liter MgCl 2 ; 0.1 g/liter 

25 NaCl; 0.13 g/liter CaCl 2 ; 0.02 g/liter FeS0 4 * 7H 2 0, pH 4.5) containing 0.5 mg/ml alkaline 
hydrolyzed crude FBI. After 3-5 days at 28°C in the dark with no agitation the cultures 
were filtered through low protein binding 0.45 micron filters to recover the culture filtrate. 
Phenylmethyl sulfonyl fluoride (PMSF) was added to a concentration of 2.5 mM and the 
culture filtrate was concentrated using an Amicon™ YM10 membrane in a stirred cell at 

30 room temperature, and resuspended in 50 mM sodium acetate, pH 5.2 containing 10 mM 
CaCl 2 . The crude culture filtrate (approx. 200-fold concentrated) was stored at -20°C. 
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To obtain preparative amounts of enzyme-hydrolyzed fumonisin, 10 mg. of FBI 
(Sigma) was dissolved in 20 mL of 50 mM sodium acetate at pH 5.2 + 10 mM CaCl 2 , and 
0.25 mL of 200x concentrated crude culture filtrate of 2141.10 was added. The solution 
was incubated at 37°C for 14 hours, and then cooled to room temperature. The reaction 

5 mixture was brought to approx. pH 9.5 by addition of 0.4 mL of 4 N KOH, and the mixture 
was extracted twice with 10 mL ethyl acetate. The combined organic layers were dried 
under N 2 and resuspended in dH 2 0. 2.5 milligrams of organic extracted material were 
analyzed by Fast Atom Bombardment (FAB) mass spectrometry. The resulting mass 
spectrum showed a major ion at M/z (+1)=406 mass units, indicating the major product of 

10 enzymatic hydrolysis was API which has a calculated molecular weight of 405. 



Example 2 

Preparation of API-induced and non-induced mycelium. 

15 Liquid cultures of Exophiala spinifera isolate 2141.10 were prepared from YPD 

agar plates (Yeast Extract 10 gm, Bacto-Peptone 20 gm, Dextrose 0.5 gm, and Bacto-Agar 
15 gm per liter of water). Aliquots (400-500 uL) of a water suspension of E. spinifera cells 
from YPD agar were spread uniformly onto 150 x 15 mm YPD agar plates with 4 mm 
sterile glass beads. The plates were incubated at room temperature for 6-7 days. The 

20 mycelia/conidia were transferred from the agar plates into Mineral Salts Medium (MSM) 
(Na 2 HP0 4 7H 2 0 0.2 gm, NH 4 C1 1.0 gm, CaCl 2 2H 2 0 0.01 gm, FeS0 4 7H 2 0 0.02 gm per 
liter of distilled water, pH 4.5) and centrifuged at 5000 x g, 4°C, 20 minutes to pellet the 
cells. The cell pellet was rinsed once in 40 ml MSM and recentrifuged. The rinsed cell 
pellet was used to inoculate MSM at a 1:19 ratio of packed cells: MSM. The culture to be 

25 induced was supplemented with API to a final concentration of 0.5-1.0 mg/ml and 
incubated at 28 °C, 100 rpm, in the dark to induce catabolic enzymes. The non-induced 
cultures did not receive API but were grown on media containing 4- ABA at the same 
concentration as API. The supernatants were removed by filtration through 0.45 cellulose 
acetate. The remaining mycelial mat was washed with sterile MSM and then frozen in 

30 liquid nitrogen for storage. 
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Example 3 

Effect of FBI and API on maize coleoptiles 

Maize coleoptiles from 4 day dark-grown germinated maize seeds were excised 
above the growing point and placed in 96-well microtiter plates in the presence of 60 
5 microliters of sterile distilled water containing FBI or API at approximately equimolar 
concentrations of 1.5, .5, .15, .05, .015, .005, .0015, or .0005 millimolar, along with water 
controls. After 2 days in the dark at 28° C the coleoptiles were placed in the light and 
incubated another 3 days. Injury or lack thereof was evaluated as follows: 

10 
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+ = brown necrotic discoloration of coleoptile 



- = no symptoms (same as water control) 

The results (see table above) indicate there is at least a 30-fold difference in toxicity 
between FBI and API to maize coleoptiles of this genotype. This is in general agreement 

15 with other studies where the toxicity of the two compounds was compared for plant 
tissues: In Lemna tissues, API was approx. 40-fold less toxic (Vesonder et aL" Arch 
Environ Contam Toxicol 23: 464-467 (1992).). Studies with both AAL toxin and FBI in 
tomato also indicate the hydrolyzed version of the molecule is much less toxic (Gilchrist et 
al, Mycopathologia 117: 57-64 (1992)). Lamprecht et aL also observed an approximate 

20 100-fold reduction in toxicity to tomato by API versus FBI (Lamprecht et al, 
Phytopathology 84: 383391 (1994)) 

Example 4 

Effect of FBI and API on maize tissue cultured cells (Black Mexican Sweet, BMS) 

25 FBI or API at various concentrations was added to suspensions of BMS cells 

growing in liquid culture medium in 96-well polystyrene plates. After 1 week the cell 
density in wells was observed under low power magnification and growth of toxin-treated 
wells was compared to control wells that received water. Growth of BMS cells was 
significantly inhibited at 0.4 micromolar FBI, but no inhibition was observed until 40 

30 micromolar API. This represents an approximate 100-fold difference in toxicity to maize 
tissue cultured cells. Similarly Van Asch et al. (VanAsch et al, Phytopathology 82: 1330- 



1332 (1992)) observed significant inhibition of maize callus grown on solid medium at 1.4 
micromolar FBI . API was not tested in that study, however. 

Example 5 

5 APAO Activity 

A cell-free extract that contains the deaminase activity was obtained by subjecting 
substrate-induced Exophiala spinifera cells to disruption using a Bead Beater™ in 50 mM 
Na-phosphate, pH 8.0, and recovering the cell-free supernatant by centrifiigation and .45 
micron filtration. Catabolic activity is assayed by incubating extracts with API 

10 (hydrolyzed fumonisin Bl backbone) or I4 C-labelled API with the extract and evaluating 
by TLC on CI 8 or C60 silica. The product 2-OP has a lower Rf than API and is detected 
either by radiolabel scan or by H 2 S0 4 spray/charring of the TLC plate. 2-OP does not react 
with the amine reagent, fluorescamine, that is routinely used to detect API on TLC plates, 
suggesting that the amine group is missing or chemically modified. Activity is greater at 

15 37°C than at room temperature, but following 30 min. at 65°C or 100°C (no API catabolic 
activity remained). Activity is maximal at pH 9. At pH 9, complete conversion to 2-OP 
occurred in 30 minutes. Activity is retained by 30,000 dalton molecular weight cutoff 
membrane, but only partially retained by 100,000 dalton molecular weight cutoff 
membrane. Other amine-containing substrates were tested for modification by the crude 

20 extract. Fumonisin, with tricarballylic acids attached, is not modified by the extract, 
indicating that ester-hydrolysis must occur first for the APAO to be able to be effective in 
modifying FBI (as noted below, the E. co/z'-expressed, recombinant APAO enzyme does in 
fact oxidize FBI although at a lower rate than API). Other long-chain bases (sphingosine, 
sphinganine, and phytosphingosine) are apparently not modified by the crude APAO, 

25 suggesting the enzyme(s) is specific for the fumonisin backbone. Preparative amounts of 
the product, named 2-OP, have also been purified and analyzed by C13 nmr. The results 
indicate that 2-OP has a keto group at carbon 2 instead of an amine, consistent with an 
oxidative deamination by an amine oxidase. The CI 3 nmr data also indicate that 2-OP 
spontaneously forms an internal hemiketal between C-l and C-5, resulting in a 5- 

30 membered ring with a new chiral center at C-2. All other carbon assignments are as in 
API, thus 2-OP is a compound of composition C^H^O^ FW 404. The product of the 
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enzyme acting on hydrolyzed fiimonisin would not be expected to display any significant 
toxicity. 

Other enzymes were tested for their ability to modify API. All enzymes were 
assayed by radiolabeled TLC ? as described above, under optimal conditions at 37° Celsius, 
5 overnight or longer. The results are as follows: 



Deaminating 


EC 


Source 


Result 


Monoamine Oxidase 


1.4.3.4 


bovine plasma 


negative 


D-amino oxidase 


1.4.3.3 


porcine kidney; TypeX 


negative 


L-amino oxidase 


1.4.3.2 


C.adamanteus venom; Typel 


negative 


Tyramine oxidase 


1.4.3.4 


Arthrobacter spp 


negative 


Methylamine dehydrogenase 


1A99.3 


Paracoccus denitrificans 


negative 


Aralkyl amine dehydrogenase 


1.4.99.4 


Alcaligenes faecalis 


negative 


Phenylalanine ammonia lyase 


4.3.1.5 


Rhodotorula glutinis; Typel 


negative 


Histidine ammonia lyase 


4.3.1.3 


Pseudomonas fluorescens 


negative 


L-aspartase 


4.3.1.1 


Hafnia alvei (Bacterium cadaveris) 


negative 


Tyrosine oxidase 


1.14.18.1 


mushroom 


negative 


Lysine oxidase 


1.4.3.14 


Trichoderma viride 


negative 


Diamine oxidase 


1.4.3.6 


porcine kidney 


negative 



The results were negative for each enzyme tested. Therefore isolates from the 
American Type Culture Collection (ATCC) were collected. The ATCC isolates selected 

10 were listed as containing amine-modifying enzymes or were capable of growth/utilization 
on amine-containing substrates. The isolates were tested to determine if they could grow 
on or utilize API as the sole carbon source and if any could modify API to a new 
compound(s). The nitrogen sources that were used in liquid cultures were API 0.1% 
(w/v), s-butylamine 0.1% (v/v), n-butylamine 0.1% (v/v), and ammonium nitrate 0.2% 

15 (w/v). These were prepared in VogeFs Minimal Media (without NH 4 N0 3 ) containing 2% 
sucrose. The isolates were inoculated into the various media and monitored for growth 
over 2-3 weeks. They were also assayed with the 14 C-radiolabeled TLC assay for API 
modification. In summary, none of the isolates tested exhibited modification of API in 
vivo. Clearly the APAO enzyme is unique and unusual in its ability to modify the API 

20 toxin. 

EXAMPLE 6 
Isolation of the trAPAO Polynucleotide 

The trAPAO polynucleotide was identified using a proprietary transcript imaging 
25 method that compares transcript patterns in two samples and allows cloning of 
differentially expressed fragments. This technology was developed by CuraGen® (New 
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Haven, CT). (see Published PCT patent application no. WO 97/15690, published May 1, 
1997, and hereby incorporated by reference) Fluorescently-tagged, PCR amplified cDNA 
fragments representing expressed transcripts can be visualized as bands or peaks on a gel 
tracing, and the cDNA from differentially expressed (induced or suppressed) bands can be 
5 recovered from a duplicate gel, cloned and sequenced. Known cDNAs can be identified 
without the need for cloning, by matching the predicted size and partially known sequence 
of specific bands on the tracing. 

In the present invention two RNA samples were obtained from cultures of E. spinifera 
grown for a specified period in a mineral salts medium containing either API (induced 

10 condition), or gamma-aminobutyric acid (ABA; non-induced condition) as a sole carbon 
source. In the induced condition, fumonisin esterase and APAO enzyme activities are 
detected, whereas in the non-induced condition these activities are not detected. The 
methods used for induction of APAO and detection of activity are described earlier (see 
Example 2 and Example 5). RNA was extracted from induced mycelium by Tri-Reagent 

15 methods (Molecular Research Center Inc., Cincinnati, Ohio) only grinding a frozen slurry 
of tissue and Tri-Reagent with a mortar and pestle until almost melted and adding an 
additional extraction after the phase separation by extracting the aqueous phase one time 
with phenol, and two times with a phenol:chloroform:isoamyl alcohol mixture. The 
RNA's were submitted for CuraGen® transcript imaging to detect cDNA fragments that 

20 are induced specifically in the presence API. In the resulting gel tracing several bands 
were found which showed induction of at least 2-fold and up to 79-fold or even 100-fold or 
more in API. In the resulting gel tracing several bands were found which showed 
induction of at least 10-fold in API -grown cells as compared to cells grown in ABA. The 
sequence of two highly induced bands can be found in Table 1 . 

25 

TABLE 1 

Nucleotide sequence of two CuraGen® bands that were identified as strongly induced 
by API in cultures of Exophiala spinifera. 

30 >k0n0-395.5_b (SEQ ID NO: 1) 

GGGCCCCGGCGTTCTCGTAGGCTGCGCGGAGTTGGTCCCAGACAGACTTTTGTCGTACCTGCTTG 
GACTGTTGGGACCACTTCCGTCCCGGGTCTCCGACCATGAAACAGGTAATGGACCATTGTCGAT 
CGACGTCGATGCTGGTATCTCTGGCAAATGAGATGGGGTCACAGCTCGATTGGAGGACGCCCGA 
GAAGCCTTGTTCGCGCCACCACGGCTTGTCCCATACGAAGACTATCTTGCTATAGTAGCCCAGG 

35 ATAGAATTTTCCGCCAATGCTTGCTTCTCGGCGGGAAGAGGTGGTGAAAATGTCAAGGTGGGAT 
ACAAGGTTGTCGGTAACGAAACCANCACCTTTTTGCTTCGGAACACGGCGC 
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>r0c0-182.3_6 (SEQ ID NO: 2) 

GAATTTTCCGCCAATGCTTGCTTCTCGGCGGGAAGAGGTGGTGAAAATGTCAAGGTGGGATACA 
AGGTTGTCGGTAACGAAACCACCACCTTTTTGCTTCGGAACACGGCGCCCGAGGCCGATCGTAC 
TGTACAGCCGGATGCCGACTGCTCAATTTCAGCGACGGGGGTGTTGAGGTGCAC 



Two of the highly induced bands, k0n0-395*5, and rOcO-182.3 showed significant 
sequence homology to a family of enzymes, flavin-containing amine oxidases (EC 
1.4.3.4), that oxidize primary amines to an aldehyde or ketone, releasing ammonia and 
10 hydrogen peroxide (Table 2). 



TABLE 2 

15 Identification of a putative flavin amine oxidase from E. spinifera: API-induced 
transcript fragments with amine oxidase homology. BLAST 2.0 default parameters. 



Clone 
ID 


Size 


Best 
Hit 


Best Hit Name, 
source 


Prob 


from 


to 


Likely 
function 


kOnO- 
395.5 


395 bp 


P40974 


putrescine oxidase, 
Micrococcus rubens, 
EC 1.4.3.10 
Length = 478 


8.0 e 
-07 


276 


333 


oxidation of 
C-2 amine of 
API 


rOcO- 

182.3 

(contigs 

with 

kOnO- 

395) 


182 bp 


P12398 


monoamine oxidase type 
A (MAO- A) [Bos taurus] 
Length = 527 


0.0039 


238 


296 


oxidation of 
C-2 amine of 
API 



20 The chemical structure of the primary product of API deamination is thought to be a 2- 

keto compound which cyclizes to a hemiketal at carbons 2 and 5. Therefore it is predicted 
that this induced enzyme is responsible for deamination of API. 

Using sequence derived from k0n0-395.5 ? a partial cDNA was obtained by 3' and 5' 
RACE-PCR (Chenchik, et al, CLONTECHniques X 1 :5-8 (1995); Chenchik, et al, A new 

25 method for full-length cDNA cloning by PCR. In A Laboratory 
Guide to RNA: Isolation, Analysis, and Synthesis. Ed. Krieg, P.A. (Wiley-Liss, Inc.), 273- 
321 (1996)). A RACE cloning kit from CLONTECH was used, to obtain the RACE 
amplicons. Briefly, poly A+ RNA is transcribed to make first strand cDNA using a "lock- 
docking" poly T, cDNA synthesis primer, the second strand is synthesized and the 
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Marathon cDNA adaptor is ligated to both ends of the ds cDNA. Diluted template is then 
used with the Marathon adapter primer and in separate reactions either a 5' Gene Specific 
Primer (GSP) or a 3'GSP is used to produce the 3 ' or 5' RACE amplicon. After 
characterization of the RACE product(s) and sequencing, full-length cDNAs may be 

5 generated by 1) end-to-end PCR using distal 5 ' and 3' GSPs with the adapter-ligated ds 
cDNA as template, or 2) the cloned 5' and 3 '-RACE fragments may be digested with a 
restriction enzyme that cuts uniquely in the region of overlap, the fragments isolated and 
ligated. Subsequently, the RACE-generated full-length cDNAs from 1) and 2) may be 
cloned into a suitable vector. 

10 In combination with the supplied adapter primer the following gene specific primers 

were used: for 3' RACE the oligonucleotide N21965: 5'- 

TGGTTTCGTTACCGACAACCTTGTATCCC-3' (SEQ ID NO: 3) and for 5' race, the 
oligonucleotide N21968: 5 ' -GAGTTGGTCCC AGAC AGACTTTTGTCGT-3 ' (SEQ ID 
NO: 4. The polynucleotide sequence of the trAPAO polynucleotide, k0n0-395_6.5, from 

15 Exophiala spinifera is shown in SEQ ID NO: 5. The polypeptide sequence of trAPAO is 
shown in SEQ ID NO: 6. 

A second clone of APAO containing an unspliced intron was also found. The 
polynucleotide sequence of trAPAO-I polynucleotide, k0n0-395_5.4, the intron containing 
clone, from Exophiala spinifera, can be found in SEQ ID NO: 7. The polypeptide sequence 

20 of trAPAO-I with the intron spliced out is shown in SEQ ID NO: 8. The polypeptide 
sequence of trAPAO-I without the intron spliced out is shown in SEQ ID NO: 9. 

EXAMPLE 7 
Heterologous Expression of trAPAO 

25 Protein alignments generated with PileUp (GCG) indicate that k0n0-395_6.5 

(trAPAO) is similar in size to other flavin amine oxidases and is close to being full length 
with respect to the amino terminus of their class of proteins. The kOnO-395 6.5 sequence 
contains a complete p-a-p fold that is required for dinucleotide (FAD) binding, close to 
the amino end. The kOnO-395 sequence appears to lack only a variable amino terminal 

30 segment that varies in length from 5 amino acids in rat monoamine oxidases A & B to 40 
amino acids in length in Aspergillus MAO-N. The function of these amino terminal 
extensions is not known; they are not recognizable as secretion signals. Based on the likely 
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localization of the Exophiala APAO outside the cell membrane, the prediction is that 
kOnO-395 would have a signal sequence similar to that of the fumonisin esterase cloned 
from the same organism (US patent no. 5,716,820, supra). Using Genome Walker ™, it is 
possible to clone the 5' end of the transcript and upstream genomic regulatory elements. 

5 However, the signal sequence is not expected to be critical to the functionality of the 
enzyme; in fact, the preferred strategy for heterologous expression in maize and Pichia 
pastoris involves replacing the endogenous signal sequence (if present) with an optimized 
signal sequence for the organism, e.g. barley alpha amylase for maize and the yeast alpha 
factor secretion signal for Pichia. In maize transformed with fumonisin esterase, the 

10 barley alpha amylase signal sequence gave higher amounts of functional protein than the 
native fungal signal, therefore replacement of the native fungal signal sequence is a logical 
optimization step. Since many of the amine oxidases have a positively charged amino acid 
near the N-terminus and upstream of the dinucleotide binding site, an additional 
optimization step included adding a codon for the lysine (K) to the N-terminus of the 

15 trAPAO clone (k0n0-395_6.5, SEQ ID NO: 5). This clone is designated KrtrAPAO and 
can be seen in SEQ ID NOS: 10 and 11. The extra lysine is at amino acid 1 and 
nucleotides 1-3. 



EXAMPLE 8 

20 Pichia Expression of trAPAO 

For optimum expression of trAPAO in Pichia pastoris the alpha mating factor 
signal peptide was operably linked in-frame with K:trAPAO coding sequence and can be 
seen in SEQ ID NOS: 16 and 17. The nucleotide sequence of clone 
pPicZalphaA:K:trAPAO contains a PCR-amplified insert comprising the kOnO-395 open 

25 reading frame with an additional lysine residue at the amino terminus, with a 5' EcoRI site 
and 3' NotI site for in-frame cloning into the alpha factor secretion vector pPicZalphaA. 
Nucleotides 1-267 contain the yeast a mating factor secretion signal. The amino acid 
sequence, shown in SEQ ID NO: 17, contains the trAPAO polypeptide produced from 
pPicZalphaA:K:trAPAO following transformation into Pichia pastoris. 

30 For cloning into expression vectors, two cloning strategies were used. The cDNA 

k0n0-395_5.4 was generated by using end-to-end PCR using distal 5' and 3 ? GSPs with 
the adapter-ligated double stranded cDNA as a template. Each oligonucleotide primer was 
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designed with 5' restriction enzyme sites that contain a 23-25 bp of anchored gene 
sequence. The 3' primer also included the stop codon. The primer sequences are N23256: 
5'-ggggaattcAAAGACAACGTTGCGGACGTGGTAG-3' (SEQ ID NO: 12) and N23259: 
5 '-ggggcggccgcCTATGCTGCTGGCACC AGGCTAG-3 * (SEQ ID NO: 13). A second 

5 method was used to generate k0n0-395_6.5. 5' RACE and 3' RACE products using a distal 
primer containing the necessary restriction enzyme sites, stop codon, etc as described 
above and paired with a "medial" GSR The "medial primers" N21965: 5'- 
TGGTTTCGTTACCGAC AACCTTGTATCCC-3 ' (SEQ ID NO: 14) for 3' RACE and for 
5' race, the oligonucleotide N21968: 5 '-GAGTTGGTCCCAGACAGACTTTTGTCGT-3 ' 

10 (SEQ ID NO: 15). Adapter-ligated double stranded cDNA was used as template. The 
isolated 5 ? and 3 '-RACE fragments were digested with a restriction enzyme that cuts 
uniquely in the region of overlap, in this case Bgl I, isolated and ligated into the expression 
vector. The digestible restriction sites allow cloning of the inserts in-frame into EcoRI/NotI 
digested pPicZalphaA. pPicZalphaA is an E. coli compatible Pichia expression vector 

15 containing a functional yeast alpha factor secretion signal and peptide processing sites, 
allowing high efficiency, inducible secretion into the culture medium of Pichia. The 
resulting 1.4 kb bands were cloned into EcoRI/NotI digested pPicZalphaA plasmid. 

SEQ ID NO: 16 contains the polynucleotide sequence of clone 
pPicZalphaA:K:trAPAO, a PCR-amplified insert that comprises the kOnO-395 open 

20 reading frame with an additional lysine residue at the amino terminus, and a 5' EcoRI site 
and 3' NotI site for in-frame cloning into the alpha factor secretion vector pPicZalphaA. 
SEQ ID NO: 17 contains the amino acid sequence of the trAPAO polypeptide produced 
from pPicZalphaA:K:trAPAO following transformation into Pichia pastoris. The alpha 
factor secretion signal and a lysine are added. 

25 Pichia was transformed as described in Invitrogen Manual, Easy Select™ Pichia 

Expression Kit, Version B, #161219, with the trAPAO polynucleotide as described above 
with either an intron (trAPAO-I, negative control, no expression of active trAPAO since 
Pichia does not splice introns very efficiently) or without an intron (capable of making an 
active APAO protein). The Pichia culture fluids and pellets were assayed for APAO 

30 activity as described earlier. 

The set of frozen six day Pichia culture cell pellets contained two samples with 
intron (SEQ ID NO: 7) in gene construct, # 11, # 14, and two samples without intron in 
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gene construct (SEQ ID NO: 5), #6, # 52. The six day culture fluids from the same 
cultures were used to spike with crude fungal enzyme for positive controls. 

The 50 jlxI cell pellets were resuspended in 150 |iil cold 50mM Na-phosphate, pH 
8.0, and divided into two fresh 500 \xl tubes. One tube was kept on ice with no treatment, 
5 the pellet suspension, and one tube was used for lysis. An equal volume of 0.1 mm 
zirconia-silica beads was added to each tube. The tubes were BeadBeat™ for 15 seconds 
then cooled on ice 5 minutes. This was repeated three times. The crude lysate was then 
transferred to another tube for assay or lysate suspension. 

The TLC assays were performed as follows, the samples are 1) pellet suspensions; 

10 10 (il; 2) lysate suspensions; 10 3) media controls-mixed 5 (il media with 5 |ul crude 
fungal enzyme; 10 jjlI; 4) positive control-used crude fungal enzyme undiluted; 10 (il; 5) 
substrate control-used 50mM Na-phosphate, pH8.0; 10 |ul. Ten microliters of each sample 
plus 10 )il of 14 C-AP1 (1 mg/ml, 50 mM Na-phosphate, pH 8) was incubated at room 
temperature for 6 days. One microliter of the sample was spotted onto CI 8 and C60 TLC 

15 plates. The CI 8 plates were developed in MeOH:4% KC1 (3:2). The C60 plates were 
developed in CHCl 3 :MeOH:CH 3 COOH:H 2 0 (55:36:8:1). The plates were then air dried 
and then exposed to a PhosphorScreen™ for 2-3 days. A Storm™ Phosphorlmager was 
used to develop the images. 

A positive TLC result is obtained if an additional radioactive spot appears at a 

20 lower Rf of the produced API modification earlier identified as 2-OP, a deaminated 
product of API. In samples # 6 and # 52 (without intron) the API -modifying enzyme 
activity (conversion of API to 2-OP) was detected in pellet suspensions and pellet ly sates, 
although the majority of activity was associated with the pellet suspensions. In samples 
#11 and #14 (with intron) a minimal amount of API -modifying enzyme activity was 

25 detectable in the pellet lysate of # 14 only, which indicates Pichia cannot process the 
intron efficiently. 

This experiment verified APAO activity can be detected in Pichia transformants, 
which verifies that trAPAO as described functions correctly in degrading API. The 
activity is associated with cell suspensions, which show higher activity than pellet lysates. 
30 Pellet lysates may show less activity due to release of endogenous proteases during lysis of 
the cells. 
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EXAMPLE 9 
Expression of trAPAO or APAO in E. coli 
The vector for expressing K:trAPAO in E. coli is pGEX-4T-l. This vector is a 
prokaryotic glutathione S-transferase (GST) fusion vector for inducible, high-level 
5 intracellular expression of genes or gene fragments as fusions with Schistosoma japonicum 
GST, GST gene fusion vectors include the following features, a lac promoter for inducible, 
high-level expression; an internal lac Iq gene for use in any E. coli host; and the thrombin 
factor Xa or PreScission Protease recognition sites for cleaving the desired protein from 
the fusion product. The insert of interest, k0n0-395_6.5 (KrtrAPAO) or APAO, was 
10 subcloned into the 5' EcoRI site and a 3 ? NotI site allowing in-frame expression of the 
GST:K:trAPAO or GST:APAO fusion peptide. 

The polynucleotide sequence of the GST:K:trAPAO fusion can be found in SEQ 
ID NO: 18. The GST fusion with polylinker can be found at nucleotides 1 to 687. The 
K:trAPAO can be found at nucleotides 688 to 2076. The resulting polypeptide for the 
15 GST:K:trAPAO fusion can be seen at SEQ ID NO: 19. Amino acids 1 to 229 represent the 
GST fusion plus polylinker and amino acids 230 to 692 represent the K:trAPAO portion of 
the fusion. 

E. coli was transformed with the pGEX-4T-l vector containing K:trAPAO or 
APAO as described in BRL catalogue, Life Technologies, Inc. catalogue; Hanahan, D., J. 

20 Mol Biol 166:557 (1983) Jessee, J. Focus 6:4 (1984); King, P.V. and Blakesley, R. ? Focus 
8:1 , 1 (1986), and hereby incorporated by reference. The transformed E. coli was induced 
by addition of IPTG (isopropyl b-D-thiogalactopyranoside). Four samples of soluble 
extract and four samples of insoluble inclusion bodies were tested for trAPAO or APAO 
activity as described in Example 8. APAO activity was present in all soluble samples and 

25 two insoluble samples. Highest activity was found at 10 uM IPTG induction. Thus the 
pGEX-4T-l vector containing K:trAPAO or APAO is capable of producing active APAO 
enzyme in E, coli. 

EXAMPLE 10 

30 

The Complete Nucleotide Sequence of the Exophiala APAO Gene 

Using Genome Walker, the complete nucleotide sequence of the Exophiala APAO 
gene was recovered. The nucleotide sequence described in SEQ ID NO: 5 is missing a 
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portion of the 5' end of the native gene. The missing portion of the 5' end of the native 
gene is not necessary for expression of an active APAO enzyme, as can be seen in 
Examples 8 and 9. The complete nucleotide sequence of APAO can be seen in SEQ ID 
NO: 22. The translation of SEQ ID NO: 22 can be found in SEQ ID NO: 23. 

5 

EXAMPLE 11 
Expression of APAO and ESP1 in transgenic maize callus 

One of the preferred constructs for expression in maize is the nucleotide sequence of 
the trAPAO operably linked to the barley alpha amylase signal sequence. The nucleotide 

10 sequence of K:trAPAO translational fusion with barley alpha amylase signal sequence, for 
expression and secretion of the mature trAPAO in maize can be seen in SEQ ID NO: 20. 
Nucleotides 1-72, represent the barley alpha amylase signal sequence; nucleotides 73-75, 
represent the added lysine residue; and nucleotides 76 -1464 , represent the trAPAO 
cDNA. The amino acid sequence translation of SEQ ID NO: 20 can be found in SEQ ID 

15 NO: 21. Amino acids 1 to 24 represent the barley alpha amylase signal sequence and 
amino acids 25 to 463 is the sequence of K:trAPAO. 

Maize embryos were transformed with linear DNA (insert, lacking a bacterial 
antibiotic resistance marker), derived from constructs containing three transcription units: 
1) a PAT selectable marker gene (Wohlleben et al 9 Gene 70, 25-37 (1988)), 2) fumonisin 

20 esterase ESP1 operably linked to a barley alpha amylase signal sequence, and 3) full length 
APAO without or with an amino-terminal barley alpha amylase signal sequence, (PI 3 603, 
comprising a PAT selectable marker operably linked to a 35S promoter, fumonisin esterase 
ESP1 operably linked to a barley alpha amylase signal sequence and the ubiquitin 
promoter, and APAO operably linked to the ubiquitin promoter and PI 361 1, comprising a 

25 PAT selectable marker operably linked to the 35S promoter, fumonisin esterase ESP1 
operably linked to a barley alpha amylase signal sequence and the ubiquitin promoter and 
APAO operably linked to a barley alpha amylase signal sequence and the ubiquitin 
promoter). In these constructs both ESP1 and APAO were linked to the maize ubiquitin 
promoter and first intron. In a third construct, the same three transcriptional units were 

30 cloned into an Agrobacterium Tl vector (PI 5258, the construct comprises a PAT 
selectable marker, fumonisin esterase ESP1 operably linked to a barley alpha amylase 
signal sequence and APAO). Stably transformed callus or TO plants regenerated from 
callus were tested for ESP1 and APAO activity in buffer extracts of leaf tissue, using 



radiolabeled FBI and/or API and CI 8 thin-layer chromatography. Positive controls 
consist of non-transformed tissue spiked with E co/z-expressed recombinant ESP1 or 
APAO. The results indicate that both ESP1 and APAO activities can be detected in 
transgenic maize callus and plants. 



Expression of ESP1 and APAO in transgenic callus 



(~*nn struct 


Samnle ID 
Number 


ESP1 activity (TLC) 


APAO activity (TLC) 


13603 


3065.031-2 




+ 


13603 


3065 034-3 


+ 


+ 


11603 


3065 1117-3 




+ 


13603 


3065.1 ls7-nl 3 


+ 


+ 


13603 


3065.117-2 


+ 


+ 


13603 


3065.1115-2 


+ 


+ 


13603 


3065.1115-6 


+ 


+ 


13603 


3065.1112-1 


+ 


+ 


13603 


3065.118-6 


+ 


+ 


13603 


3065.11s3-l 


+ 


+ 


13603 


3065.11sl-13 


+ 


+ 


13603 


2805.762-2 


+ 


+ 


13603 


3065.1110-2 


+ 


+ 


13603 


3065.039-2 




+ 


13611 


3065.293-3 


+ 


+ 


13611 


3065.263-1 


+ 


+ 


13611 


3070.24.2.3 




+ 



Transgenic plants were regenerated from the transgenic callus positive for both 
10 ESP1 and APAO activity by standard methods known in the art. Enzyme activity was 
tested as described previously. As can be seen below transgenic maize plants can 
successfully express both ESP1 and APAO enzymes. 



Expression of APAO and ESP1 in transgenic maize plants (TO) 



Construct 


Sample ID Number 


ESP1 activity (TLC) 


APAO activity (TLC) 


13603 


910080 


+ 


+ 


13603 


910081 


+ 


+ 


13603 


917065 


+ 


+ 



Another preferred construct for expression of APAO in a plant is targeting the 
20 APAO to the peroxisome. Maize embryos were bombarded with insert containing APAO 
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operably linked to ubiquitin promoter and a peroxisomal targeting sequence (Gould, et at, 
J Cell Biol 108:1657-1664 (1989)); ESP1 operably linked to ubiquitin promoter and the 
barley alpha amylase signal sequence; and a selectable marker of PAT operably linked to 
the 35S promoter (construct number 114952). Negative controls were unbombarded 

5 embryos/callus. Positive controls were unbombarded embryos/callus spiked with purified 
enzyme. Transformed callus was then tested for ESP1 or APAO activity as previously 
described. Out of 67 samples tested 18 samples contained both ESP1 activity and APAO 
activity. Peroxisomally targeted APAO and apoplast targeted fumonisin esterase can both 
be successfully expressed in a plant cell. 

10 Another preferred construct for expression of APAO in a plant is targeting the 

APAO to the mitochondrial membrane. A C-terminal extension is required for targeting 
monoamine oxidases MAO-A and MAO-B to mammalian outer mitochondrial 
membranes. A MAO-A, MAO-B, or functionally similar C-terminal extension can be 
ligated in-frame to APAO or trAPAO to facilitate localization of this enzyme to the 

15 mitochondrial membrane of maize or other transformed species. 

EXAMPLE 12 
Comparison of APAO Sequence With Other Sequences 

20 The Exophiala cDNA APAO (SEQ ID NO: 22) contains an 1800 bp open reading 

frame coding for a 600 amino acid polypeptide (SEQ ID NO: 23) with divergent homology 
to two classes of proteins. The carboxy three-fourths of APAO (amino acids 137 to 593) is 
strongly homologous to flavin amine oxidases, a group of enzymes catalyzing the 
oxidative deamination of primary amines at carbon 1 . The amine oxidase function of the 

25 carboxy terminal domain was confirmed by expression of a truncated APAO polypeptide 
(from 137 to 600) in both Pichia pastoris and E. coli, using API as a substrate (see 
Example 9). The amino terminal portion of APAO, in contrast, (from approx. 5 to 134) 
shows significant homology to a group of small deduced open reading frames (ORFs) 
reported in several bacteria and blue-green algae, as well as several higher organisms. 

30 These ORFs code for small proteins of unknown function, ranging in size from 14 to 17 
kDA. The juxtaposition of these divergent homologies in a single polypeptide has not been 
reported previously. 
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Flavin amine oxidases (E.C. 4.1.4.3) are a group of flavoenzymes found in both 
higher and lower organisms, and serve a variety of functions in catabolism. They catalyze 
the oxidative deamination of primary amino groups located at the C-l position of a variety 
of substrates, resulting in an aldehyde product plus ammonia and hydrogen peroxide. The 
5 APAO enzymes of the present invention are the first flavin amine oxidase known to attack 
a primary amine not located at C-l (i.e. C-2 of API) and resulting in a keto rather than 
aldehydic product. However, amino acid oxidases, while not closely related to flavin 
amine oxidases, are flavoenzymes that oxidize a C-2 amine adjacent to a C-l carboxyl 
group. 

10 The monoamine oxidases MAO A & B, (from human, bovine, and trout), are 

localized in the mitochondrial outer membrane of higher organisms and regulate the level 
of neurotransmitters. Microbial examples include a fungal amine oxidase {Aspergillus 
niger (niger) MAO-N) involved in amine catabolism, and a bacterial putrescine oxidase 
from a gram (+) bacterium (Micrococcus rubens.). The primary polypeptides vary in 

15 length from 478 to 527 amino acids, and share regions of high amino acid sequence 
conservation at the 5' end as well as at various points through the coding region. Protein 
alignments generated with PileUp (GCG) indicate that trAPAO contains all conserved 
domains found in this class of proteins including those near the 5 ? end. 

The amine oxidase domain of trAPAO contains several key features shared by this 

20 class of enzymes, including an amino-terminal dinucleotide (ADP) binding region 
characterized by a beta-alpha-beta stretch containing three invariant glycines (G -X-G-X- 
X-G) in the beta-alpha turn. In trAPAO, this sequence is (DWVVGAGLSG). This 
region is involved in FAD binding. Absent are several features unique to the mammalian 
amine oxidases, including several important cysteine residues (Wu et al, Mo I Pharm 

25 43:888 (1993)), one of which (Cys-406 of MAO-A) is involved in covalent binding of 
FAD, and a carboxy-terminal extension that has been demonstrated to be involved in 
transporting to and anchoring the MAO in the outer mitochondrial membrane. The 
Aspergillus enzyme MAO-N has been demonstrated to contain non-covalent FAD, and 
also lacks the conserved cysteine. Therefore it is possible that the APAO enzyme has a 

30 non-covalent FAD. The Aspergillus MAO-N has a carboxy-terminal tripeptide Ala-Arg- 
Leu that is involved in peroxisomal targeting and localization; this sequence is absent from 
Exophiala MAO. 
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The amine oxidase domain of trAPAO contains a total of seven cysteines, 
compared to ten for the Aspergillus enzyme and only two for the Micrococcus enzyme. 
The mammalian MAO enzymes contain variable numbers of cysteines (at least ten), some 
of which are highly conserved (including the FAD binding residue mentioned above). The 
5 trAPAO sequence also has two putative glycosylation sites (NDS, NQS) towards the 
amino end. 

The purpose of the amino-terminal extension of APAO and the basis for its 
homology to a group of 14-17 kDa proteins is not clear. In Synechocystis, a similar 
polypeptide ORF is located immediately upstream of the NADP-dependent glutamine 

10 dehydrogenase (gdhA) and has been shown to be required for functional expression of 
gdhA (Chavez et al, 1995). However, in trAPAO the domain is clearly not necessary for 
enzymatic activity, as shown by the results of the expression experiments using the 
truncated APAO. An interesting clue comes from the frequent association of this small 
ORF with gene clusters involved in oxidoreductase activity in bacteria, or induced by heat 

15 stress in mice, suggesting a possible role in redox protection. A byproduct of amine 
oxidase activity is hydrogen peroxide. Flavoenzymes and other redox enzymes are often 
susceptible to inactivation by hydrogen peroxide (Schrader et al y App Microb Biotechnol 
45:458; Aguiree, et al. 9 J Bacteriol 171 :6243 (1989)), and it is possible that this protein has 
a protective role against oxidants such as hydrogen peroxide. Alternatively, this domain 

20 could be involved in enzyme function, localization or association of the enzyme with other 
structures. No signal peptide region can be detected in this amino terminal region. 

In multiple sequence alignment using GCG PileUp, trAPAO is most similar to 
putrescine oxidase of Micrococcus rubens, Swissprot accession number P40974, (30% 
identical amino acids, 40% similar). Homology with several mammalian monoamine 

25 oxidases A and B, Swissprot accession numbers P21397 (Homo Sapiens mao a), PI 9643 
(Rattus norvegicus mao b), P21396 (Rattus norvegicus mao a), and P21398 (Bos taurus 
mao a), is somewhat less, ranging from 25 to 28% identity and 36 to 40% similarity. 
Homology to the only other fungal flavin amine oxidase known, MAO-N from Aspergillus 
niger (Swissprot accession number P46882), is somewhat lower (24% identical, 34% 

30 similar). The microbial enzymes are considerably divergent from each other, while the 
mammalian monoamine oxidases share 65 to 87% identity. 
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The amino terminal domain (ATD) of APAO also shows homology to a 14.5 kD 
protein from human and rat phagocytes that shows translational inhibition activity in vitro 
(Swissprot accession # P52758, P52759) Schmiedeknecht, et al, Eur J Biochem 242 (2), 
339-351 (1996)), and includes a heat-responsive protein from mouse (Samuel, et al, 

5 Hepatology 25 (5), 1213-1222 (1997)). This suggests that this family of proteins is 
involved in regulating cellular metabolism. No example exists in which this domain is 
fused to a larger protein domain, however, making APAO unique. Without intending to be 
limited by theory, all of this suggests, that this domain plays a regulatory role in APAO 
gene expression, possibly to prevent translation of the message when it is not needed. This 

10 raises the question of how translation of the message is restored when active enzyme is 
required by the Exophiala cell. Possibly there are alternative start sites that begin 
downstream of the inhibitor domain; or proteolysis, complexing, degradation, or 
phosphorylation/ dephosphorylation of the inhibitor domain when it is not needed. The 
first possibility is less likely because there are no other ATG codons prior to the ATG at 

15 122-124 that constitutes the predicted start site of APAO. The second possibility cannot be 
easily tested, although there is a casein kinase site in the ATD. Alternative roles for the 
ATD include oligomerization of the APAO protein, or anchoring the protein to some 
intracellular site, such as the membrane. 

A parallel example of regulatory control over another flavoenzyme, human flavin 

20 monooxygenase 4 (FMO-4), by a C-terminal extention has been reported (Itagaki, et al, J 
of Biol Chem 271(33): 20102-20107 (1996)). In this case the introduction of a stop codon 
prior to the 81 base C-terminal extension allowed expression of active enzyme in 
heterologous systems. The role of the C-terminal portion was not elucidated, however. In 
another example, alternative splicing led to a shorter gene product that complexed with and 

25 interfered with the function of the normally spliced version (Quinet, et al, J of Biol Chem 
268(23): 16891-16894 (1993)). In another case, an alternative splicing-generated insert in 
another protein led to inhibition of cell growth (Bhat, et al, Protein Engineering 9(8): 713- 
718 (1996)). In yet another variation, fas/Apol splicing variants prevent apoptosis, 
apparently through a 49 amino acid domain shared by all variants ((Papoff, et al, J of 

30 Immunology 156(12): 4622-4630 (1996)). 
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EXAMPLE 13 

Making a Chimera Protein Containing Fumonisin Esterase and APAO activity in the Same 

Polypeptide. 

5 The enzyme activities of fumonisin esterase and APAO can be combined in a 

single polypeptide by using the open reading frames together either with or without a 
spacer region between the two polypeptides. This creates a hybrid protein with dual 
enzyme activities that can be exported as a unit to the apoplast, and will allow both enzyme 
activities to be conveniently localized to the same area of the cell wall. The two cDNAs 

10 can be combined in either order, but the preferred method is to link them in the order NH 3 - 
Esterase:APAO-COOH. The spacer, if present, may consist of a short stretch of amino 
acids such as GGGSGGGS, or a set of amino acids that comprises a protease cleavage site 
that can be acted on by an apoplastic protease. This would result in the production of 
stoichiometric amounts of both esterase and APAO enzymes in the apoplast. 

15 Alternatively, a polycystronic message could be engineered which is capable of direct 
translation of a downstream sequence, for example inclusion of an IRES sequence in the 
spacer region or a polynucleotide spacer region containing a polynucleotide cleavage site 
that can be recognized by RNAse or is a self-cleaving ribozyme. The length of the splice 
site could be of any length that ensures proper translation of the polynucleotide. 

20 The esterase-APAO ligated protein can be made with any fumonisin esterase, 

including but not limited to, the fumonisin esterase from E. spinifera (ESP1) or fumonisin 
esterase from bacterium (BEST1). Since the pH range for maximum activity of BEST1 is 
similar to that of APAO (range 6.0 to 8.0), these may present the most effective 
combination in fusion form. In addition, any of the polynucleotides of the present 

25 invention, including APAO mutated to improve expression, may be used for an esterase- 
APAO ligation. As described in previous examples these fusion sequences can be placed 
in the appropriate expression vectors and used to express proteins in either bacteria or 
plants. 

The nucleotide sequence of ESP1 contains three nucleotide differences and three 
30 corresponding amino acid differences for the ESP1 sequence disclosed in pending US 
application no. 08/888,950, filed July 7, 1997 and US patent no. 6,025,188, issued 
February 15, 2000. Both the sequences disclosed in the present application and the 
sequences disclosed in the pending US applications contain functional fumonisin esterase 
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genes. For the purposes of the present invention, either the original ESP1 sequences or the 
ESP1 sequences may be used in combination with the APAO sequences or in fusion 
sequences. The nucleotide sequence of a BAA:ESP1 rtrAPAO construct for plant 
expression can be found in SEQ ID NO: 24 and the translation in SEQ ID NO: 25. The 

5 nucleotide sequence for a BAA:BESTl:K:trAPAO construct for plant expression can be 
found in SEQ ID NO: 26 and the translation in SEQ ID NO: 27. The nucleotide sequence 
of a GST:ESPl:K:trAPAO fusion for bacterial expression in a pGEX-4T-l or similar 
vector can be found in SEQ ID NO: 28 and the translation in SEQ ID NO: 29. The 
nucleotide sequence for a GST:BESTl:K:trAPAO fusion for bacterial expression in a 

10 pGEX-4T-l or similar vector can be seen in SEQ ID NO: 30 and the translation in SEQ ID 
NO: 31. 



15 EXAMPLE 14 

APAO Substrate Studies 

The following assay was used to determine the substrate specificity of the APAO 
enzyme. Reaction mix: 436 |al of 200 mM Na-phosphate, pH8.0; 50 jxl substrate (10 mM); 

20 2 jllI Amplex Red (1 mg in 200 |al DMSO); and 2 |al of Peroxidase (5000 U/ml). The 
APAO enzyme was recombinant enzyme produced as GST fusion in E. coli, purified over 
a glutathione affinity column and cleaved with thrombin to remove the GST. All 
components were mixed at room temperature. The initial rate was determined in a 
spectrophotometer at 572 nm over one minute by absorbance units/second (BLANK). Ten 

25 microliters of APAO at 70 ug/ml was added and mixed. The initial rate was again 
determined at 572 nm over one minute in absorbance units/second (SAMPLE). The rates 
were converted to absorbance units/minute. The BLANK value was subtracted from the 
SAMPLE value. The absorbance units were converted to jiM H 2 0 2 wherein 1 jiM H 2 0 2 
equals 0.138 absorbance units at pH 8.0. 

30 
35 
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SUBSTRATES FOR APAO 



SUBSTRATE 


RATE 




liM HJOJm'm 


1 mM Fumonisin B1 


0.1429 


1 mM AP1 


0.8876 


0.5mg/ml_ Fumonisin B2 


0.3058 


1 mM Fumonisin B3 


0.1449 


0.5mg/mL Fumonisin B4 


0.1728 


1 mM norepinephrine 


0.0087 


1 mM epinephrine 


0.0071 


1 mM dopamine 


0.0040 


1 mM spermine 


0.0002 


NOT SUBSTRATES FOR APAO (defined as compounds resulting in less than 1% 



5 conversion to hydrogen peroxide by APAO relative to API under similar conditions of 
time, pH, temperature, and substrate concentration): 2-phenylethylamine, spermidine, 
EDTA-Na 2 , tryptamine, putrescine, benzamidine, serotonin, cadaverine, Pefabloc SC, 
tyramine, 1,3-diaminopropane, leupeptin, histamine, hydroxy lamine, aprotinin, deprenyl, 
Fumonisin C4, isoniazid, sphingosine, phenelzine, sphinganine, phytosphingosine, D- 

10 alanine, DL-alanine, L-arginine, L-asparagine, L-aspartic acid, D-aspartic acid, L-cysteine, 
L-glutamine, L-glutamic acid, glycine, L-histidine, L-isoleucine, L-leucine, L-lysine, DL- 
lysine, L-methionine, DL-methionine, L-phenylalanine, L-proline, L-threonine, L- 
tryptophan, L-tyrosine, L-valine. 

15 

EXAMPLE 15 
Sites on APAO for Possible Mutagenesis 

Some cytosolic enzymes, when engineered for secretion by fusion with a 
20 heterologous signal peptide, lack function due to glycosylation at one or more potential 
glycosylation sites (amino acid consensus sequence N-X-S/T) that are not normally 
glycosylated in the native environment (Farrell et al 9 Plant Mol Biol 15 (6): 82 1-5 (1990)). 
Since APAO lacks a recognizable signal sequence, it may be cytoplasmically localized in 
Exophiala spinifera, although secretion by some other method not involving a signal 
25 peptide cannot be ruled out. APAO contains two potential glycosylation sites, which may 
be glycosylated when APAO is secreted in a plant or other eukaryotic cell. Other 
modifications to APAO can be made to improve its expression in a plant system, including 



site-directed mutagenesis to remove selected cysteine residues, which may be detrimental 
to proper folding when the protein is secreted into the endomembrane system for delivery 
to the apoplast. 

Knowledge of the 3-dimensional structure of APAO would help to evaluate the 
5 likelihood that particular amino acids could contribute to misfolding, and increase the odds 
of making rational changes in the APAO sequence for successful secretion. To this end a 
3-dimensional model of APAO was developed based on the crystal structure of a related 
amine oxidase from maize, maize polyamine oxidase or MP AO (Binda et al, Structure 
7:265-276 (1999)). The model was derived by automated modeling using the program 
10 Modeler (Molecular Simulations, Inc., San Diego, CA) and the resulting 3-D structure 
showed excellent fit based on an RMS deviation of 0.68 A for the backbone coordinates 
of the two structures. The 3-D model of APAO based on MP AO is shown in Figures 1 and 
2. Some of the possible mutations of APAO, which would result in removal of 
glycosylation sites or removal of cysteine residues can be seen below and in Figure 1 . 

15 

Table of site-directed mutagenesis vectors and enzyme assay results. 



Residue number 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Glyc 
Site 
1a,b 


Glyc 
Site 
2a,b 


E co// expression 


Maize expression 


Residue position in APAO or 
trAPAO 


to 
O 


C109 


i C167 


C292 


C351 


C359 


C387 


C461 


C482 


N201 


S203 


N204 


S206 


vector, APAO or 
trAPAO activity 


vector, APAO or 
trAPAO Activity 


Construct 


Amino acid substitution 


Plasmid 


Act 


Plasmid 


Act 


Wild type APAO-1 




























PHP13367 


+ 


PHP 




Glyc(-) 1a2aAPAO 




















A 




A 




PHP16284 




n/a 




Glyc(-) 1a2bAPAO 




















A 






A 


PHP16285 




n/a 




Glyc{-) 1b2aAPAO 






















A 


A 




PHP16286 




n/a 




Glyc(-) 1b2bAPAO 






















A 




A 


PHP16287 




n/a 




Glyc(-)2a N204A APAO 
























A 




PHP16589 


+/- 


n/a 




Glyc(-)2b S206A APAO 


























A 


PHP16590 


+ 


PHP16711 




Cys(-)#8 trAPAO 
















S 












PHP16737 


+ 


n/a 




Cys(-)#6,8 trAPAO 












S 




S 












PHP16738 








Cys(-)#3,6,8 trAPAO 






S 






S 




S 












PHP17089 


+ 1 






Cys(-)#1 ,2,7 APAO 


A 


A 










A 



























































A= alanine 



S= serine 

20 1 - activity against FBI equals wild type; activity against API was reduced. 

APAO and trAPAO polypeptide sequence, annotated. (SEQ ID NO: 47) 

The amino terminal domain is italicized. Cysteines and residues involved in putative 
glycosylation sites are underlined. Boxed residues represent amino acids that were 
25 successfully altered without complete loss of activity as E coli-expressed protein. 

MALAPSYINPPNVASPA GYSHVG VGPDGGR YVTIA GQIGQDASG VTDPA YEKQ VA QAFA 
NLRACLAA VGA TSND VTKLNYYIVDYAPSKLTAIGDGLKA TFALDRLPPCTL VP VSALSSP 
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EYLFEVDA TAL FPG//7TPDNVADV WVGAGLSGLETARKVQ AAGLSgLVLEAMD 
RVGGKTLSVQSGPGRTTINDLGAAWINDSgQgEVSRLFERFHLEGELQRTTGNSIH 
QAQDGTTTTAPYGDSLLSEEVASALAELLPVWSQLIEEHSLQDLKASPQAKRLDSV 
SFAHYCEKELNLPAVLGVANQITRALLGVEAHEISMLFLTDYIKSATGLSNIFSDKK 
5 DGGQYMRCKTGMQSIgHAMSKELVPGSVHLNTPVAEIEQSASGCTVRSASGAVF 
RSKKVVVSLPTTLYPTLTFSPPLPAEKQALAENSILGYYSKIWVWDKPWWREQGF 
SGVLQSSgDPISFARDTSIDVDRQWSITCFMVGDPGRKWSQQSKQVRQKSVWDQL 
RAAYENAGAQVPEPANVLEIEWSKQQYFQGAPSAVYGLNDLITLGSALRTPFKSV 
HFVGTETSLVWKGYMEGAIRSGQRGAAEVVASLVPAA 

10 

APAO enzyme activity is maintained when a serine residue at position 206 is 
mutated to alanine, eliminating a potential glycosylation site (N204 - S206) close to the 
putative substrate binding site. Please see the tables entitled "Table of site-directed 
mutagenesis vectors and enzyme assay results" and "Glyc(-) APAO lysates from E. coli" 
15 The polynucleotide sequence of APAO mutated to alter the serine at position 206 to an 
alanine (S206A) can be seen in SEQ ID NO: 32. The resulting polypeptide is shown in 
SEQIDNO: 33. 



Glyc(-) APAO lysates from E coli 

20 



Sample (lysate) 


Substrate 


M H^/min 1 


Conclusion 


WT APAO 


API 


1.92 


Active (wild type) 




FBI 


0.12 


Slightly active (wt) 


N204A 


API 


0.09 


Slightly active 




FBI 


0.04 


Slightly active 


S206A 


API 


0.85 


Partially Active 




FBI 


0.07 


Slightly active 



However, in transient expression assays in maize, expression of S206A resulted in 
no detectable enzyme activity. Please see the table above entitled "Table of site-directed 
mutagenesis vectors and enzyme assay results." Thus, elimination of this glycosylation 

25 site is not in itself sufficient to have an active protein upon secretion. This could be due to 
glycosylation occurring at a second adjacent site (N201 - S203). However, no active 
APAO was recovered when either N201 or S203 is mutated along with S206. Please see 
the table entitled "Table of site-directed mutagenesis vectors and enzyme assay results." 

While not to be limited by theory, the molecule may be inactive because both N201 

30 and S203 are buried within the tertiary structure of APAO, and any modification of side 
chains disrupts proper folding or conformation, or FAD binding. This is backed up by 
predicted solvent accessibility numbers for these residues in the 3-D model based on the 
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maize amine oxidase. Please see the table below entitled "Solvent accessibility for 
cysteine residues of truncated APAO " The elimination of APAO glycosylation site at 
amino acids 204 to 206 is not sufficient to allow APAO to be secreted from the cell and 
retain full enzyme activity, but elimination of this site may improve chances for obtaining 
5 a fully active enzyme once the other roadblock(s) to secretability have been resolved. In 
other words, elimination of this site may be necessary but not sufficient to produce active 
secretable APAO. 

APAO also contains nine cysteine residues, which are likely to be unpaired in the 
reducing environment of the cytosol but which may crosslink unfavorably upon secretion. 
10 Cysteines are present at residues 64, 109, 167, 292, 351, 359, 387, 461, and 482. The 3-D 
model helps predict the relative location of each amino acid in the structure, and whether it 
is solvent accessible or buried. Buried residues are more difficult to mutate without 
destroying structural integrity. 



Solvent accessibility for cysteine residues of truncated APAO 



APAO Position 1 


Position 2 


Cys* 3 


aa MPAO 


-1 


0 


1 


average 


Conclusion 


Cys 


26 


167 


3 


Leu 


32 


0.675 


0.253 


0.24 


0,389333 


maybe partially 
exposed 


Cys 


151 


292 


4 


Asn 


147 


0.069 


0.122 


0.147 


0.112667 


buried 


Cys 


210 


351 


5 


Tyr 


211 


0.184 


0.244 


0.03 


0.152667 


buried 


Cys 


218 


359 


6 


Thr 


219 


0.633 


0.319 


0.447 


0.466333 


maybe partially 
exposed 


Cys 


246 


387 


7 


Val 


247 


0.145 


0.046 


0.366 


0.185667 


buried 


Cys 


320 


461 


8 


Ser 


324 


0.199 


0.789 


0.643 


0.543667 


exposed 


Cys 


341 


482 


9 


Leu 


346 


0.152 


0.071 


0.052 


0.091667 


buried 



1. Relative to amino acid 1 of truncated APAO 

2. Relative to amino acid 1 of full length APAO 
20 3. Cysteine number relative to full length APAO 

Proteins that are secreted to the apoplast are folded to their mature form in the 
highly oxidizing environment of the ER/GolgL Among other things this promotes 
crosslinking of cysteine residues often found in secreted proteins. Unpaired cysteines that 
25 are solvent-accessible are rare in secreted proteins, since they would rapidly be oxidized by 
other cysteine residues of the same protein or another protein. Although not to be limited 
by theory, it is possible that APAO is normally a cytosolic protein, and thus the presence 
of nine cysteine residues would not be unusual even though they may not be crosslinked in 
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the mature protein. In fact, the 3-D model predicts that they would not be crosslinked 
because the intermolecular distances predicted would be too great. Therefore it is possible 
that secretion of APAO to the apoplast results in an improper folding and crosslinking of 
cysteines in the Golgi, and results in inactive enzyme. Using the solvent accessibility 

5 tables from APAO modeled against MP AO, the three most solvent-exposed cysteines were 
identified and then eliminated by site-directed mutagenesis of the APAO cDNA. The 
sequence of APAO mutated at cysteine 461 and used for expression in bacteria can be seen 
in SEQ ID NO: 48. The resulting protein is shown in SEQ ID NO: 49. The polynucleotide 
and resulting polypeptide sequence of APAO mutated at both cysteines 359 and 461 and 

10 used for in the bacterial expression system can be seen in SEQ ID NOS: 50 and 51. The 
polynucleotide and resulting polypeptide sequence of APAO mutated at cysteines 169, 
359, and 461 can be seen in SEQ ID NOS: 52 and 53. 

The APAO molecules mutated at specific cysteines were tested in a bacterial 
expression system using the previously described Amplex Red assay. The results can be 

15 seen in the table below entitled "Cys(-) APAO ly sates from E. colir The mutated APAO 
molecules can then be tested in maize, linked to a signal peptide, as previously described. 
Either one of the cysteines or two or three together could be mutated to serines without any 
measured loss in APAO enzyme activity of the E co/z-expressed enzyme. In fact, one of 
the E co//-expressed clones (C359S + C461S; PHI16738) had more APAO activity in 

20 crude lysates than wild type enzyme and may represent a catalytic improvement. A triply 
Cys-mutated version of APAO does not show catalytic improvement but retains full 
activity of the wild type enzyme against FBI, although API activity was somewhat 
reduced. The mutated versions of APAO operably linked to a signal sequence, which retain 
function when expressed as recombinant fusion proteins in E. coli, may also provide 

25 additional stability or foldability when expressed in plants or other secretion expression 
systems. 

30 
35 
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Cys(-) APAO lysates from E coli 



5 



Sample (lysate) 


Substrate 


M H^/min 1 


Conclusion 


WTAPAO 


API 


2.14 


Active (wild type) 




FBI 


0.11 


Slightly active (wt) 


C461S 


API 


2.25 


Fully Active 




FBI 


0.14 


Slightly active 


C359S, C461S 


API 


3.90 


Fully/More Active 




FBI 


0.16 


Slightly active 


C167S, C359S, 


API 


0.27 


Slightly active 


C461S 


FBI 


0.25 


Slightly active 


Triple Cys(-) APAO lysates from E coli 


Sample (lysate) 


Substrate 


M HjCymin 1 


Conclusion 


WTAPAO 


API 


1.16 


Active (wild type) 




FBI 


0.27 


Slightly active (wt) 


C167S, C359S, 


API 


0.27 


Slightly Active 


C461S 


FBI 


0.26 


Slightly active 



It is expected that the S206A mutations will contribute to the functionality of 
secreted APAO by reducing the degree of glycosylation and the C167S, C359S, and 
C461S mutations (or combinations thereof) will improve the functionality of secreted 

10 APAO by reducing chances for spurious disulfide formation on folding. 

To determine expression of a mutated APAO in maize, three APAO constructs 
were introduced into maize embryos by Agrobacterium-mediated transformation (Zhao et 
al, 1999, US Patent 5,981,840). The three constructs were PHP17105 (Ubi:BAA:Cys(-)K- 
trAPAO (C359S, C461S):PinII), PHP17108 (Ubi:Cys(-)K-trAPAO (C359S, 

15 C461S):PinII), and PHP17110 (Ubi:APAO:PinII). In addition, PHP16543 
(NOS:CRC:PinII-Ubi:MO-PAT:T35) was introduced as a negative control and PHP15258 
(Ubi:APAO:PinII-Ubi:BAA:ESPl:PinII-P35S:PAT:T35S) was introduced as a non- 
targeted positive control. One experiment with two replications was performed. Samples 
were assayed for both APAO activity by TLC as described previously and by Enzyme 

20 Linked Immunosorbent Assay (ELISA). For a discussion of ELISA methods, please see, 
for example, Current Protocols in Molecular Biology, 2:11.1.1-11.3.4, John Wiley & 
Sons, Inc. (Ausubel, et al, eds. 1994). The APAO ELISA is a capture format assay for the 
quantitative determination of APAO protein in the presence of extracted maize tissue 
protein. It was performed by co-incubation of biotinylated antibody with an extract 

25 prepared from leaf, seed, or callus in phosphate buffered saline with 0.5% Tween-20®. The 
detection of the antibody complex was accomplished through the added incubation of 
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streptavidin-alkaline phosphatase (Bio-Rad Life Sciences Products #19542-018), followed 
by the addition of substrate (pNPP tablets, Sigma #104-105). The resultant color intensity 
was quantified by determining optical density and was directly proportional to the amount 
of APAO protein present in the sample extracts. The assay has no matrix effects at 
5 1 jig/well or below for maize leaf, seed, or callus protein. The standard curve was spiked 
with wild type extract at levels above L0 ng/well. The transient testing results are 
summarized in the table below. 



10 

Transient Testing of APAO Constructs (6-8-2000) 



Experiment 


Rep 


Construct 


APAO-TLC 


APAO-ELISA (ppm) 


negative control 


none 


none 


0 


-2 


4350.08.01 


1 


php16543, as a (-) control 


0 


-4 


4350.08.02 


php15258, non-targeted APAO as a (+) control 


3 


out high 


4350.08.03 


php17105, UBI-BAA::CYS(-)K-TR-APAO (C359S, C461S) 


1 


107 


4350.08.04 


php17108, UBI-CYS(-)K-TR-APAO (C359S, C461S) 


3 


270 


4350.08.05 


php17110, UBI-APAO 


3 


out high 


4350.08.06 


2 


php16543 


0 


-5 


4350.08,07 


php15258 


3 


313 


4350.08.08 


php17105 


0 


52 


4350.08.09 


php17108 


2 


143 


4350.08.10 


php17110 


2 


123 


3477.27.01 


transformed 
callus lines 


phpl 5258 as postive controls 


1 


118 


3477.27.02 


2 


141 


3477.27.03 


2 


187 


3477.27.04 


2 


184 



As can be seen in the Table above, the BAA-targeted APAO (PHP17105) did not 
15 accumulate as much APAO as the non-BAA targeted counterpart (PHP 17108). Although 
not to be limited by theory, the lack of APAO protein accumulation rather than APAO 
function may play a role in the lack of detectable APAO activity with the BAA-targeted 
APAO construct. It appears that only when the APAO concentration exceeds 100 ppm can 
APAO activity be seen by TLC. Nevertheless, the double Cys(-) mutant is active in maize 
20 when expressed either cytosolically or extracellularly. 



EXAMPLE 16 

Other APAO Polynucleotides From Exophiala spinifera and Rhinocladiella atrovirens 

25 Using primers designed from the APAO isolated from Exophiala spinifera, ATCC 

74269(Table 15), three new APAO polynucleotides were isolated from Exophiala spinifera 
(isolates ESP002 and ESP003), designated ESP002_C2, ESP002_C3 and ESP003_C12 
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and three new APAO polynucleotides from Rhinocladiella atrovirens (isolate RAT011) 
designated RAT011_C1, RAT011_C2, RAT011„C4. The strains used to isolate the 
polynucleotides are described below. 



5 



Isolate 


Genus species 


Source 


FBI 

degrader 


APAO homologs isolated 


ESP002 


Exophiala 
spinifera 


Palm, ATCC 
26089 


Yes 


ESP002_c2 in pGEX4Tl 
ESP002_c3 in pGEX4Tl 


ESP003 


Exophiala 
spinifera 


Maize seed 


Yes 


ESP003_cl2 in pGEX4Tl 


RAT011 


Rhinocladiella 
atrovirens 


Maize seed 


Yes 


RAT011_cl inpGEMllZf+ 
RAT01 l_c2 in pGEX4Tl 
RAT011 c4 in pGEMl lZf+ 



Growth conditions and production of culture material 

1. Streak 150 x 15 mm YPD plates with a glycerol aliquot of the above fungal 
isolates. 

2. Grow at 28° C in the dark until there is sufficient growth for inoculating 
liquid medium usually at least two weeks. 

3. Mycelia and spores were scraped from the plates or agar cubes used to 
inoculate 50 mis YPD broth in 250 ml baffled flasks. 

4. Flasks of culture material were grown at 28° C in the dark at -125 rpm. 

5. After sufficient growth was obtained the cultures were harvested by 
pelleting the culture in 50 ml centrifuge tubes at 3400 rpm for 15 min. 

6. The supernatant was discarded and the pellets were frozen at -20° C. 

YPD broth and agar medium 

Amount per liter: Yeast Extract 10 g 

Bactopeptone 20 g 

25 Dextrose 0.5 g 

Bactoagar 1 5 g (for agar media only) 




30 DNA Isolation, 

The DNA was isolated according to a modified version of a plant CTAB DNA 

extraction protocol (Saghai-Maroof MA, et al, Proc Natl Acad Sci, USA, 81:8014-8018 
(1984)) as follows. 

35 1. Place 0.2-0.5 g (dry weight) lyophilized fungal mycelium in a 50 ml 

disposable centrifuge tube, break up mat with a spatula or glass rod. Shake 
briefly. 

2. Add 10 ml (per 0.5 g mat) of CTAB extraction buffer. Gently mix to wet 
all the powdered mat. 
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3. 


Place in 65° C water bath for 30 minutes. 


4. 


Cool. Add an equal volume of phenolxhloroform. Shake briefly to mix. 


5. 


Centrifuge 20 minutes at 3400 rpm. 


6. 


To the aqueous phase add an equal volume of chloroform: isoamyl alcohol 




(24: 1). Shake briefly to mix. 


7. 


Centrifuge 15 minutes at 3400 rpm. 


8. 


To aqueous phase add an equal volume of isopropanol. 


9. 


Centrifuge for 30 minutes at 3400 rpm to pellet precipitated DNA. 


10. 


Rinse DNA pellet with 70% ethanol. 


11. 


Air dry pellet. 


12. 


Resuspend pellet in 1-5 ml TE containing 20 ug/ml RNase A. 



CTAB Extraction Buffer 
15 0.1 MTris,pH7.5 

1% CTAB (mixed hexadecyl trimethyl ammonium bromide) 

0.7MNaCl 

lOmM EDTA 

1% 2-mercaptoethanol 
20 Add proteinase K to a final concentration of 0.3 mg/ml prior to use. 



Primer Design 

25 Primers used were gene specific primers based on APAO polynucleotide sequence 

(SEQ ID NO: 22) with restriction enzymes sites for cloning. The 5'-primer ? 26194, 
contained the restriction enzyme recognition site, EcoRI. The complementary 3 '-primer, 
23259, contained the restriction enzyme recognition site, Notl. 

30 26194 

5' ggggaattcATGGCACTTGCACCGAGCTACATCAATC 3' , 37-mer (SEQ ID NO: 34) 
23259 

5' gggGCGGCCGCCTATGCTGCTGGCACCAGGCTAG 3' , 34-mer (SEQ ID NO: 13) 

35 

PCR conditions 
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1. 


The PCR cocktail: 


10 mM dNTPs 


1 ul 






per 50 ul reaction 


1 OX Advantage polymerase buffer 


5ul 


5 




per 0.2 ml tube 


HPLC water 


38 ul 






lOuM primer 26194 


2ul 








10 uM primer 23259 


2ul 








50 X Advantage polymerase mix 


lul 


10 






(Clontech) 










Template, genomic DNA, 50 ng/ul 


1 ul 



2. Thermocycling conditions: 

MJ PTC-100 AgV Thermocycler: 



15 


Step 1 


95° 2 minutes 




2 


95° 30 seconds 




3 


60° 1 minute 




4 


72° 1 minute 30 seconds 




5 


Go to step 2, 34X more 


20 


6 


72° 5 minutes 




7 


4° Hold 




8 


End 



25 3 . PCR products were analyzed on a 1 % LE-agarose, TAE plus ethidium 

bromide gel. 

Bands of about 1900 bp were seen on the gel. The band was not present in 
the no DNA control reaction. 

30 

Cloning Protocols 

1 . DNA was extracted from excised gel fragments using a QIAGEN Gel 
Extraction Kit (Catalog number 28704, QIAGEN, Santa Clara, CA). 

2. PCR fragments were digested with EcoRI and Not I to free up the sites for 
35 cloning into EcoRI and Not I digested vector, either pGEX4Tl (Phamacia) 

or pGEMl lZf+ (Promega). 

3. Digests were cleaned up and desalted used a QIAquick PCR Purification 
Kit (Catalog number 28104). 

4. Isolated fragment was quantified and checked for purity on a 1% LE- 
40 agarose, TAE + ethidium bromide gel. 

5. Fragments were ligated into compatible sites in either pGEX4Tl 
(Phamacia) or pGEMl lZf+ (Promega). 

6. After heat inactivation Library efficiency DH5 competent E. coli were 
transformed with a small amount of the ligation reaction. 

45 7. LB + carbenicillin, 50 ug/ml, plates were spread with an aliquot of the 

transformation mix, grown overnight at 37° C. 
8. Colonies were screened for full-length insert using a PCR miniprep method 
utilizing vector primers flanking the multiple cloning region. 
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9. Positive clones were identified and overnight cultures grown for plasmid 
isolation and verification by sequencing. 

10. Positive clones are identified as follows: 

DH5 :pGEX4Tl :ESP002FL_c2 ( from palm tree isolate) 
5 DH5 :pGEX4Tl :ESP002FL_c3 ( from palm tree isolate) 

DH5 :pGEX4Tl :ESP003FL_cl2 ( from maize isolate) 
DH5 :pGEMl lZf+:RAT01 lFL_cl ( from maize isolate) 
DH5 :pGEMl lZf+:RAT01 1FL_c4 ( from maize isolate) 
DH5 :pGEX4Tl :RAT01 1FL_c2 ( from maize isolate) 

10 

* important note: These are genomic clones containing two introns 
Sequence Results 

Three APAO polynucleotides and related polypeptides were isolated from 
15 Exophiala spinifera (isolates ESP002 and ESP003), designated ESP002_C2, (SEQ ID 
NOS: 35 and 36) ESP002_C3 (SEQ ID NOS: 37 and 38) and ESP003J312 (SEQ ID NOS: 
39 and 40). Three APAO polynucleotides were isolated from Rhinocladiella atrovirens 
(isolate RAT011) designated RAT011_C1 (SEQ ID NOS: 41 and 42), RAT011J32 (SEQ 
ID NOS: 43 and 44), and RAT011_C4 (SEQ ID NOS: 45 and 46). Introns were detected 
20 by comparison of the genomic sequence with the cDNA sequence of APAO from E. 
spinifera 2141.10 (SEQ ID NO: 22), and by identifying putative intron splice junctions in 
the gap domains (Shah, et aL, Journal of Molecular and Applied Genetics 2:111-126 
(1983)). 

Plasmids containing the polynucleotide sequences of the invention were deposited 
25 with American Type Culture Collection (ATCC), Manassas, Virginia, and assigned 
Accession No. 98812, 98813, 98814, 98815, 98816, (all deposited on July 15, 1998) and 
PTA-32 (deposited on May 7, 1999). The deposits will be maintained under the terms of 
the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for 
the Purposes of Patent Procedure. The deposits were made merely as a convenience for 
30 those of skill in the art and are not an admission that a deposit is required under 35 U.S.C. 
§112. 

Preliminary sequence results were entered into GCG, and nucleotide and protein 
alignments were done in a pileup using a software program called Genedoc for shading and 
homology comparisons (Nicholas, et aL, EMBNEW.NEWS 4:14 (1997; or at the Internet 
35 site http://www.cris.com/~Ketchup/genedoc.shtml). The first APAO (SEQ ID NO: 22) 
sequence was included for comparison. Comparing the reference sequence SEQ ID NO: 
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22 to the other homologs sequence identities range from 96 to 99% (identities are lower 
since APAO introns were not included). Homologies are slightly higher comparing 
Exophiala genes sequences. At the amino acid sequence level the comparison of the 
reference sequence (SEQ ID NO: 23) to the other homologs yielded sequence identities of 

5 approximately 97%. 

All publications and patent applications in this specification are indicative of the 
level of ordinary skill in the art to which this invention pertains. All publications and 
patent applications are herein incorporated by reference to the same extent as if each 
individual publication or patent application was specifically and individually indicated by 

10 reference. 

The invention has been described with reference to various specific and preferred 
embodiments and techniques. However, it should be understood that many variations and 
modifications may be made while remaining within the spirit and scope of the invention. 
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WHAT IS CLAIMED IS: 



1. An isolated polynucleotide comprising an APAO encoding polynucleotide linked 
to a fumonisin esterase encoding polynucleotide, wherein the APAO encoding 
polynucleotide comprises a member selected from: 

a) a polynucleotide encoding a polypeptide selected from SEQ ID NOS: 6, 1 1, 23, 
33, 36, 38, 40, 42, 44, 46, 49, 51 and 53; 

b) a polynucleotide having at least 70% sequence identity to a polynucleotide 
selected from SEQ ID NOS: 5, 10, 22, 33, 35, 37, 39, 41, 43, 45, 48, 50 and 52; and 

c) a polynucleotide selected from SEQ ID NOS: 5, 10, 22, 32, 35, 37, 39, 41, 43, 
45, 48, 50 and 52. 

2. A recombinant expression cassette comprising a polynucleotide of claim 1 operably 
linked to a promoter. 

3. The recombinant expression cassette of claim 2 wherein the polynucleotide is 
operably linked to a plant signal sequence. 

4. A vector comprising the recombinant expression cassette of claim 2. 

5. A host cell comprising the recombinant expression cassette of claim 2. 

6. The host cell of claim 5 wherein the cell is a plant cell. 

7. The host cell of claim 6 wherein the plant cell is selected from the group consisting 
of maize, sorghum, wheat, tomato, soybean, alfalfa, sunflower, canola, cotton, barley, 
millet, and rice. 

8. A plant comprising a polynucleotide of claim 1 . 

9. A seed from a plant of claim 7. 
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10. An isolated polypeptide comprising a member selected from: 

a) a polypeptide comprising at least 70% sequence identity to a polypeptide 
selected from SEQ ID NOS: 6, 11, 23, 33, 36, 38, 40, 42, 44, 46, 49, 51 and 53; 

b) a polypeptide encoded by a polynucleotide having at least 70% sequence 
identity to a polynucleotide selected from SEQ ID NOS: 5, 10, 22, 32, 35, 37, 39, 
41,43,45, 48, 50 and 52; and 

c) a polypeptide selected from SEQ ID NOS: 6, 1 1, 23, 33, 36, 38, 40, 42, 44, 46, 
49,51 and 53. 

11. The polynucleotide of claim 1 wherein the fumonisin esterase encoding 
polynucleotide is ESP1 . 

12. The polynucleotide of claim 1 1 wherein the polynucleotide is set forth in SEQ ID 
NO: 24. 

13. The polynucleotide of claim 1 wherein the fumonisin esterase encoding 
polynucleotide is BEST1. 

14. The polynucleotide of claim 13 wherein the polynucleotide is set forth in SEQ ID 
NO: 26. 

15. A method of degrading fumonisin, a structurally related mycotoxin, a fumonisin 
breakdown product, or a breakdown product of a structurally related mycotoxin comprising 
the steps of: 

a) applying an APAO enzyme as a spray or wash; and 

b) under degradation conditions allowing sufficient time for the polypeptide to 
degrade the fumonisin, the structurally related mycotoxin, the fumonisin 
breakdown product, or the breakdown product of a structurally related mycotoxin. 

16. The method of claim 15 wherein the fumonisin or structurally related mycotoxin is 
present in harvested grain. 
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17. The method of claim 15 wherein degradation occurs during processing of the 
harvested grain. 

18. The method of claim 1 7 wherein the harvested grain is to be used as animal feed. 

19. The method of claim 15 wherein degradation occurs in silage. 

20. The method of claim 15 wherein fumonisin esterase is also added at or before step 
(a). 

2 1 . The method of claim 1 5 wherein the APAO enzyme is selected from: 

a) a polypeptide comprising at least 70% sequence identity to a polypeptide 
selected from SEQ ID NOS: 6, 11, 23, 33, 36, 38, 40, 42, 44, 46, 49, 51 and 53; 

b) a polypeptide encoded by a polynucleotide having at least 70% sequence 
identity to a polynucleotide selected from SEQ ID NOS: 5, 10, 22, 32, 35, 37, 39, 
41 9 43 ? 45, 48, 50 and 52; and 

c) a polypeptide selected from SEQ ID NOS: 6, 11, 23, 36, 38, 40, 42, 44, 46, 49, 
51 and 53. 

22. A method of identifying transformed plant cells comprising the steps of: 

a) introducing into a plant cell at least one copy of an expression cassette 
comprising an APAO encoding polynucleotide; 

b) placing the plant cell on culture media containing an API or a 
phytotoxic analog; and 

c) identifying transformed cells as the surviving cells in the culture. 

23. The method of claim 22 wherein the APAO encoding polynucleotide comprises a 
polynucleotide having at least 70% sequence identity to a polynucleotide selected from 
SEQ ID NOS: 5, 10, 22, 32, 35, 37, 39, 41, 43, 45, 48, and 50. 

24. The method of claim 22 wherein a fumonisin esterase encoding polynucleotide is 
also introduced into the plant cell. 
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25. A method of detecting fumonisins or structurally related toxins, the method 
comprising: 

a) adding APAO enzymes to a sample containing fumonisin or a 
structurally related toxin; 

b) reacting the sample under conditions of time and temperature sufficient 
to convert the toxin to the corresponding oxidized or deaminated toxin; 
and 

c) detecting the hydrogen peroxide or ammonia produced. 

26. The method of claim 25 wherein the APAO enzyme is encoded by a polynucleotide 
having at least 70% sequence identity to a polynucleotide selected from SEQ ID NOS: 5, 
10, 22, 32, 35, 37, 39, 41, 43, 45, 48, and 50. 

27. The method of claim 25 wherein fumonisin esterase is added at or before step (a). 

28. A method of producing a plant capable of degrading fumonisin, a structurally 
related mycotoxin, a fumonisin breakdown product, or a breakdown product of a 
structurally related mycotoxin comprising the steps of: 

a) introducing into a plant cell at least one copy of an expression cassette 
comprising a polynucleotide encoding an APAO enzyme operably linked to a promoter; 
and 

b) under degradation conditions expressing the APAO enzyme for a time 
sufficient to degrade the fumonisin, the fumonisin breakdown product, the structurally 
related mycotoxin, API, or a breakdown product of a structurally related mycotoxin. 

29. The method of claim 28 wherein a polynucleotide encoding a fumonisin esterase is 
also introduced. 

30. The method of claim 28 wherein the APAO enzyme is encoded by a polynucleotide 
having at least 70% sequence identity to a polynucleotide selected from SEQ ID NOS: 5, 
10, 22, 32, 35, 37, 39, 41, 43, 45, 48, 50 and 52. 
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3 1 . The method of claim 28 wherein the plant cell is regenerated into a plant. 

32. The method of claim 28 wherein a fumonisin esterase encoding polynucleotide is 
also introduced. 

33. A host cell comprising an APAO encoding polynucleotide and a fumonisin esterase 
encoding polynucleotide. 

34. The host cell of claim 33 wherein the APAO encoding polynucleotide comprises a 
polynucleotide having at least 70% identity to a polynucleotide selected from SEQ ID 
NOS: 5, 10, 22, 32, 35, 37, 39, 41, 43, 45, 48, and 50. 

35. The host cell of claim 33 wherein the fumonisin esterase encoding polynucleotide 
is selected from ESP1 and BEST1. 

36. The host cell of claim 33 wherein the cell is a plant cell. 

37. The host cell of claim 36 wherein the cell is selected from maize, sorghum, wheat, 
tomato, soybean, alfalfa, sunflower, canola, cotton, and rice. 

38. The host cell of claim 37 wherein the plant cell is regenerated into a plant. 

39. A method of predicting possible mutagenesis sites on APAO comprising the steps 
of: 

a) developing a 3-dimensional model of APAO; and 

b) identifying sites on APAO to mutate by evaluating the likelihood that particular 
amino acids could contribute to misfolding. 

40. A 3-dimensional model of APAO generated by an automated modeling program. 

41 . The model of claim 40 wherein the automated modeling program is Modeler. 
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AMINO POLYOL AMINE OXIDASE POLYNUCLEOTIDES AND RELATED 
POLYPEPTIDES AND METHODS OF USE 



5 Abstract 

The present invention provides polynucleotides and related polypeptides of the 
enzyme APAO isolated from Exophiala spinifera and Rhinocladiella atrovirens. The 
polynucleotides may be mutated to remove glycosylation sites and cysteine residues. 
Additionally, the present invention provides recombinant expression cassettes, host cells, 

10 transgenic plants, and transgenic seed. The present invention also provides for 
polynucleotides containing both APAO and a fumonisin esterase. In addition, the present 
invention provides methods for producing the APAO enzyme in both prokaryotic and 
eukaryotic systems, methods for detecting fumonisins, and methods for identifying 
transformed plant cells. Methods for degrading fungal toxins in plants, grain, grain 

15 processing, silage, food crops and in animal feed are also disclosed. 
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SEQUENCE LISTING 



<110> Duvick, Jonathan P. 
Gilliam, Jacob T. 
Maddox, Joyce R. 
Rao,Aragula Gururaj 
Crasta, Oswald R. 
Folkerts, Otto 

<120> Amino Polyol Amine Oxidase 

Polynucleotides and Related Polypeptides and Methods of Use 



<130> 1134R 

<150> US 60/092,936 
<151> 1998-07-25 

<150> US 60/135,391 
<151> 1999-05-21 

<150> US 09/352,159 
<151> 1999-07-12 

<150> US 09/352,168 
<151> 1999-07-12 

<160> 53 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 372 
<212> DNA 

<213> Exophiala spinifera 
<220> 

<221> misc_feature 
<222> (346) . . . (346) 
<223> n = A, T , C or G 

<400> 1 

gggccccggc gttctcgtag gctgcgcgga gttggtccca gacagacttt tgtcgtacct 60 

gcttggactg ttgggaccac ttccgtcccg ggtctccgac catgaaacag gtaatggacc 12 0 

attgtcgatc gacgtcgatg ctggtatctc tggcaaatga gatggggtca cagctcgatt 180 

ggaggacgcc cgagaagcct tgttcgcgcc accacggctt gtcccatacg aagactatct 240 

tgctatagta gcccaggata gaattttccg ccaatgcttg cttctcggcg ggaagaggtg 3 00 

gtgaaaatgt caaggtggga tacaaggttg tcggtaacga aaccancacc tttttgcttc 360 

ggaacacggc gc 3 72 

<210> 2 
<211> 182 
<212> DNA 

<213> Exophiala spinifera 
<400> 2 

gaattttccg ccaatgcttg cttctcggcg ggaagaggtg gtgaaaatgt caaggtggga 60 
tacaaggttg tcggtaacga aaccaccacc tttttgcttc ggaacacggc gcccgaggcc 120 
gatcgtactg tacagccgga tgccgactgc tcaatttcag cgacgggggt gttgaggtgc 180 
ac 182 



<210> 3 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Designed oligonucleotide for 3' RACE, N21965 



<400> 3 

tggtttcgtt accgacaacc ttgtatccc 29 

<210> 4 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Designed oligonucleotide for 5 1 RACE, 21968 
<400> 4 

gagttggtcc cagacagact tttgtcgt 2 8 



<210> 5 
<211> 1389 
<212> DNA 

<213> Exophiala spinifera 

<220> 
<221> CDS 

<222> (1) . . . (1386) 
<400> 5 

gac aac gtt gcg gac gtg gta gtg gtg 
Asp Asn Val Ala Asp Val Val Val Val 
1 5 

gag acg gca cgc aaa gtc cag gcc gcc 
Glu Thr Ala Arg Lys Val Gin Ala Ala 
20 25 



ggc get ggc ttg age ggt ttg 4 8 

Gly Ala Gly Leu Ser Gly Leu 
10 15 

ggt ctg tec tgc etc gtt ctt 96 
Gly Leu Ser Cys Leu Val Leu 
30 



gag gcg atg gat cgt gta ggg gga aag act ctg age gta caa teg ggt 144 
Glu Ala Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser Gly 
35 40 45 

ccc ggc agg acg act ate aac gac etc ggc get gcg tgg ate aat gac 192 
Pro Gly Arg Thr Thr lie Asn Asp Leu Gly Ala Ala Trp lie Asn Asp 
50 55 60 

age aac caa age gaa gta tec aga ttg ttt gaa aga ttt cat ttg gag 240 
Ser Asn Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu Glu 
65 70 75 80 

ggc gag etc cag agg acg act gga aat tea ate cat caa gca caa gac 288 
Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser He His Gin Ala Gin Asp 
85 90 95 

ggt aca acc act aca get cct tat ggt gac tec ttg ctg age gag gag 336 
Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu Glu 
100 105 110 



gtt gca agt gca ctt gcg gaa etc etc ccc gta tgg tct cag ctg ate 384 

2 



Val Ala Ser 
115 

gaa gag cat 
Glu Glu His 
130 

etc gac agt 
Leu Asp Ser 
145 

cct get gtt 
Pro Ala Val 



gtg gaa gec 
Val Glu Ala 



agt gec acc 
Ser Ala Thr 
195 

cag tat atg 
Gin Tyr Met 
210 

tea aag gaa 
Ser Lys Glu 
225 

gaa att gag 
Glu lie Glu 



gec gtg ttc 
Ala Val Phe 



tat ccc acc 
Tyr Pro Thr 
275 

ttg gcg gaa 
Leu Ala Glu 
290 

tgg gac aag 
Trp Asp Lys 
305 

teg age tgt 
Ser Ser Cys 



gat cga caa 
Asp Arg Gin 



aag tgg tec 
Lys Trp Ser 
355 



Ala Leu Ala 



age ctt caa 
Ser Leu Gin 



gtg age ttc 
Val Ser Phe 
150 

etc ggc gta 
Leu Gly Val 
165 

cac gag ate 
His Glu lie 
180 

ggt etc agt 
Gly Leu Ser 



cga tgc aaa 
Arg Cys Lys 



ctt gtt cca 
Leu Val Pro 
230 

cag teg gca 
Gin Ser Ala 
245 

cga age aaa 
Arg Ser Lys 
260 

ttg aca ttt 
Leu Thr Phe 



aat tct ate 
Asn Ser He 



ccg tgg tgg 
Pro Trp Trp 
310 

gac ccc ate 
Asp Pro He 
325 

tgg tec att 
Trp Ser He 
340 

caa cag tec 
Gin Gin Ser 



Glu Leu Leu 
120 

gac etc aag 
Asp Leu Lys 
135 

gcg cac tac 
Ala His Tyr 



gca aac cag 
Ala Asn Gin 



age atg ctt 
Ser Met Leu 
185 

aat att ttc 
Asn He Phe 
200 

aca ggt atg 
Thr Gly Met 
215 

ggc tea gtg 
Gly Ser Val 



tec ggc tgt 
Ser Gly Cys 



aag gtg gtg 
Lys Val Val 
265 

tea cca cct 
Ser Pro Pro 
280 

ctg ggc tac 
Leu Gly Tyr 
295 

cgc gaa caa 
Arg Glu Gin 



tea ttt gee 
Ser Phe Ala 



ace tgt ttc 
Thr Cys Phe 
345 

aag cag gta 
Lys Gin Val 
360 



Pro Val Trp 



gcg age cct 
Ala Ser Pro 
140 

tgt gag aag 
Cys Glu Lys 
155 

ate aca cgc 
He Thr Arg 
170 

ttt etc acc 
Phe Leu Thr 



teg gac aag 
Ser Asp Lys 



cag teg att 
Gin Ser He 
220 

cac etc aac 
His Leu Asn 
235 

aca gta cga 
Thr Val Arg 
250 

gtt teg tta 
Val Ser Leu 



ctt ccc gee 
Leu Pro Ala 



tat age aag 
Tyr Ser Lys 
300 

ggc ttc teg 
Gly Phe Ser 
315 

aga gat acc 
Arg Asp Thr 
330 

atg gtc gga 
Met Val Gly 



cga caa aag 
Arg Gin Lys 



Ser Gin Leu 
125 

cag gcg aag 
Gin Ala Lys 



gaa eta aac 
Glu Leu Asn 



get ctg etc 
Ala Leu Leu 
175 

gac tac ate 
Asp Tyr He 
190 

aaa gac ggc 
Lys Asp Gly 
205 

tgc cat gec 
Cys His Ala 



acc ccc gtc 
Thr Pro Val 



teg gec teg 
Ser Ala Ser 
255 

ccg aca acc 
Pro Thr Thr 
270 

gag aag caa 
Glu Lys Gin 
285 

ata gtc ttc 
He Val Phe 



ggc gtc etc 
Gly Val Leu 



age ate gac 
Ser He Asp 
335 

gac ccg gga 
Asp Pro Gly 
350 

tct gtc tgg 
Ser Val Trp 
365 



He 



egg 432 
Arg 



ttg 480 

Leu 

160 

ggt 528 
Gly 



aag 576 
Lys 



ggg 624 
Gly 



atg 672 
Met 



get 720 

Ala 

240 

ggc 768 
Gly 



ttg 816 
Leu 



gca 864 
Ala 



gta 912 
Val 



caa 960 

Gin 

320 

gtc 1008 
Val 



egg 1056 
Arg 



gac 1104 
Asp 



3 



caa etc cgc gca gec tac gag aac gec ggg gec caa gtc cca gag ccg 1152 
Gin Leu Arg Ala Ala Tyr Glu Asn Ala Gly Ala Gin Val Pro Glu Pro 
370 375 380 



gec aac gtg etc gaa ate gag tgg teg aag cag cag tat ttc caa gga 1200 
Ala Asn Val Leu Glu lie Glu Trp Ser Lys Gin Gin Tyr Phe Gin Gly 
385 390 395 400 

get ccg age gec gtc tat ggg ctg aac gat etc ate aca ctg ggt teg 1248 
Ala Pro Ser Ala Val Tyr Gly Leu Asn Asp Leu lie Thr Leu Gly Ser 
405 410 415 

gcg etc aga acg ccg ttc aag agt gtt cat ttc gtt gga acg gag acg 1296 
Ala Leu Arg Thr Pro Phe Lys Ser Val His Phe Val Gly Thr Glu Thr 
420 425 430 

tct tta gtt tgg aaa ggg tat atg gaa ggg gec ata cga teg ggt caa 1344 
Ser Leu Val Trp Lys Gly Tyr Met Glu Gly Ala lie Arg Ser Gly Gin 
435 440 445 

cga ggt get gca gaa gtt gtg get age ctg gtg cca gca gca 1386 
Arg Gly Ala Ala Glu Val Val Ala Ser Leu Val Pro Ala Ala 
450 455 460 

tag 1389 

<210> 6 
<211> 462 
<212> PRT 

<213> Exophiala spinifera 
<400> 6 

Asp Asn Val Ala Asp Val Val Val Val Gly Ala Gly Leu Ser Gly Leu 

15 10 15 

Glu Thr Ala Arg Lys Val Gin Ala Ala Gly Leu Ser Cys Leu Val Leu 

20 25 30 

Glu Ala Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser Gly 

35 40 45 

Pro Gly Arg Thr Thr lie Asn Asp Leu Gly Ala Ala Trp lie Asn Asp 

50 55 60 

Ser Asn Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu Glu 
65 70 75 80 

Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser lie His Gin Ala Gin Asp 

85 90 95 

Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu Glu 

100 105 110 

Val Ala Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu lie 

115 120 125 

Glu Glu His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys Arg 

130 135 140 

Leu Asp Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn Leu 
145 150 155 160 

Pro Ala Val Leu Gly Val Ala Asn Gin He Thr Arg Ala Leu Leu Gly 

165 170 175 

Val Glu Ala His Glu He Ser Met Leu Phe Leu Thr Asp Tyr He Lys 

180 185 190 

Ser Ala Thr Gly Leu Ser Asn He Phe Ser Asp Lys Lys Asp Gly Gly 

195 200 205 

Gin Tyr Met Arg Cys Lys Thr Gly Met Gin Ser He Cys His Ala Met 

210 215 220 

Ser Lys Glu Leu Val Pro Gly Ser Val His Leu Asn Thr Pro Val Ala 

4 



225 










230 










235 










240 


Glu 


lie 


Glu 


Gin 


Ser 
245 


Ala 


Ser 


Gly 


Cys 


Thr 
250 


Val 


Arg 


Ser 


Ala 


Ser 
255 


Gly 


Ala 


Val 


Phe 


Arg 
260 


Ser 


Lys 


Lys 


Val 


Val 
265 


Val 


Ser 


Leu 


Pro 


Thr 
270 


Thr 


Leu 


Tvr 


Pro 


Thr 
275 


Leu 


Thr 


Phe 


Ser 


Pro 
280 


Pro 


Leu 


Pro 


Ala 


Glu 
285 


Lys 


Gin 


Ala 


Leu 


Ala 
290 


Glu 


Asn 


Ser 


lie 


Leu 
295 


Gly 


Tvr 


Tyr 


Ser 


Lys 
300 


He 


Val 


Phe 


Val 


Trr> 


Asp 


Lys 


Pro 


Trr> 


Trn 


Arg 


Glu 


Gin 


Gly 


Phe 


Ser 


Gly Val 


Leu 


Gin 


305 










310 










315 










320 


Ser 


Ser 


Cvs 


Asp 


Pro 


lie 


Ser 


Phe 


Ala 


Arg Asp 


Thr 


Ser 


He 


Asp Val 










325 










330 










335 




Asp 


Arcr 


Gin 


Tro 


Ser 


He 


Thr 


Cys 


Phe 


Met 


Val 


Gly Asp 


Pro 


Gly Arg 








340 










345 










350 






Lys 




Ser 
355 


Gin 


Gin 


Ser 


Lys 


Gin 
360 


Val 


Arg 


Gin 


Lys 


Ser 
365 


Val 


Trp 


Asp 


Gin 


Leu 




Ala 


Ala 


Tvr 
j-yj. 


Glu 


Asn 


Ala 


Gly Ala 


Gin 


Val 


Pro 


Glu 


Pro 




370 










375 










380 










Ala 


Asn 


Val 


Leu 


Glu 


Ile 


Glu 




Ser 


Lys 


Gin 


Gin 


Tyr 


Phe 


Gin Gly 


385 










390 










395 










400 


-r-i _L d 




ut.1 


Ala 


Val 
405 


iyr 


Gly 


Leu 


Asn 


Asp 
410 


Leu 


He 


Thr 


Leu 


Gly 
415 


Ser 


Ala 


Leu 


Arg 


Thr 


Pro 


Phe 


Lys 


Ser 


Val 


His 


Phe 


Val 


Gly Thr 


Glu 


Thr 








420 










425 










430 






Ser 


Leu 


Val 


Trp 


L ys 


Gly 


Tyr 


Met 


Glu 


Gly Ala 


He 


Arg 


Ser 


Gly 


Gin 






435 










440 










445 








Arg 


Gly 
450 


Ala 


Ala 


Glu 


Val 


Val 
455 


Ala 


Ser 


Leu 


Val 


Pro 
460 


Ala 


Ala 







<210> 7 
<211> 1442 
<212> DNA 

<213> Exophiala spinifera 

<220> 
<221> CDS 
<222> (1) . . . (646) 

<221> intron 
<222> (647) . . . (699) 

<221> CDS 

<222> (700) . . . (1439) 

<400> 7 
gac aac gtt gcg gac gtg gta 
Asp Asn Val Ala Asp Val Val 
1 5 

gag acg gca cgc aaa gtc cag 
Glu Thr Ala Arg Lys Val Gin 
20 

gag gcg atg gat cgt gta ggg 
Glu Ala Met Asp Arg Val Gly 
35 

ccc ggc agg acg act ate aac 
Pro Gly Arg Thr Thr He Asn 
50 55 



gtg gtg ggc get ggc ttg age ggt ttg 4 8 

Val Val Gly Ala Gly Leu Ser Gly Leu 
10 15 

gec gec ggt ctg tec tgc etc gtt ctt 96 
Ala Ala Gly Leu Ser Cys Leu Val Leu 
25 30 

gga aag act ctg age gta caa teg ggt 144 
Gly Lys Thr Leu Ser Val Gin Ser Gly 
40 45 

gac etc ggc get gcg tgg ate aat gac 192 
Asp Leu Gly Ala Ala Trp He Asn Asp 
60 



5 



age aac caa age gaa gta tec aga ttg ttt gaa aga ttt cat ttg gag 240 
Ser Asn Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu Glu 
65 70 75 80 

ggc gag etc cag agg acg act gga aat tea ate cat caa gca caa gac 288 
Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser lie His Gin Ala Gin Asp 
85 90 95 

ggt aca acc act aca get cct tat ggt gac tec ttg ctg age gag gag 336 
Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu Glu 
100 105 110 

gtt gca agt gca ctt gcg gaa etc etc ccc gta tgg tct cag ctg ate 384 
Val Ala Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu lie 
115 120 125 

gaa gag cat age ctt caa gac etc aag gcg age cct cag gcg aag egg 432 
Glu Glu His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys Arg 
130 135 140 

etc gac agt gtg age ttc gcg cac tac tgt gag aag gaa eta aac ttg 480 
Leu Asp Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn Leu 
145 150 155 160 

cct get gtt etc ggc gta gca aac cag ate aca cgc get ctg etc ggt 528 
Pro Ala Val Leu Gly Val Ala Asn Gin He Thr Arg Ala Leu Leu Gly 
165 170 175 

gtg gaa gee cac gag ate age atg ctt ttt etc acc gac tac ate aag 576 
Val Glu Ala His Glu He Ser Met Leu Phe Leu Thr Asp Tyr He Lys 
180 185 190 

agt gee acc ggt etc agt aat att ttc teg gac aag aaa gac ggc ggg 624 
Ser Ala Thr Gly Leu Ser Asn He Phe Ser Asp Lys Lys Asp Gly Gly 
195 200 205 

cag tat gtg cga tgc aaa aca g gtgcgtgtgg tgtegtctea ggtgggggac 676 
Gin Tyr Val Arg Cys Lys Thr 
210 215 

tegtttctea gtggtcattc cag gt atg cag teg att tgc cat gec atg tea 728 

Gly Met Gin Ser He Cys His Ala Met Ser 
220 225 

aag gaa ctt gtt cca ggc tea gtg cac etc aac acc ccc gtc get gaa 776 
Lys Glu Leu Val Pro Gly Ser Val His Leu Asn Thr Pro Val Ala Glu 
230 235 240 

att gag cag teg gca tec ggc tgt aca gta cga teg gee teg ggc gec 824 
He Glu Gin Ser Ala Ser Gly Cys Thr Val Arg Ser Ala Ser Gly Ala 
245 250 255 

gtg ttc cga age aaa aag gtg gtg gtt teg tta ccg aca acc ttg tat 872 
Val Phe Arg Ser Lys Lys Val Val Val Ser Leu Pro Thr Thr Leu Tyr 
260 265 270 

ccc acc ttg aca ttt tea cca cct ctt ccc gee gag aag caa gca ttg 920 
Pro Thr Leu Thr Phe Ser Pro Pro Leu Pro Ala Glu Lys Gin Ala Leu 
275 280 285 

gcg gaa aat tct ate ctg ggc tac tat age aag ata gtc ttc gta tgg 96 8 

Ala Glu Asn Ser He Leu Gly Tyr Tyr Ser Lys He Val Phe Val Trp 



6 



290 



295 



300 



305 



gac aag ccg tgg tgg cgc gaa caa ggc ttc teg ggc gtc etc caa teg 1016 
Lp Lys Pro Trp Trp Arg Glu Gin Gly Phe Ser Gly Val Leu Gin Ser 
310 315 320 

acre tgt gac ccc ate tea ttt gee aga gat acc age ate gac gtc gat 
sir Cys Lp Pro He Ser Phe Ala Arg Asp Thr Ser He Asp Val Asp 

330 335 



325 



ccra caa tgg tec att acc tgt ttc atg gtc gga gac ccg gga egg aag 
Arg Gin Trp Ser He Thr Cys Phe Met Val Gly Asp Pro Gly Arg Lys 
340 345 350 

too tec caa cag tec aag cag gta ega caa aag tct gtc tgg gac caa 
Sp Ser o" Gin Ser LyI Gin Val Arg Gin Lys Ser Val Trp Asp Gin 
355 360 365 

etc cgc gca gee tac gag aac gec ggg gee caa gtc cca gag ccg gec 
III Arg La La Tyr Glu Asn Ala Gly Ala Gin Val Pro Glu Pro Ala 
370 375 380 

aac gtg etc gaa ate gag tgg teg aag cag cag tat ttc caa gga get 
Asn Val Leu Glu He Glu Trp Ser Lys Gin Gin Tyr Phe Gin Gly Ala 
390 395 

ccg age gee gtc tat ggg ctg aac gat etc ate aca ctg got teg gcg 
Pro Ser Ala Val Tyr Gly Leu Asn Asp Leu He Thr Leu Gly Ser Ala 



405 



410 



etc aoa acg ccg ttc aag agt gtt cat ttc gtt gga acg gag acg tct 
llu Arg Thr Pro Phe Lyl Ser Val His Phe Val Gly Thr Glu Thr Ser 

425 43 0 



420 



tta ott tog aaa ggg tat atg gaa ggg gec ata cga teg ggt caa ega 
Hu III SS Lys III Tyr Met Glu Gly Ala He Arg Ser Gly Gin Arg 



435 



440 445 



ggt get gca gaa gtt gtg get age ctg gtg cca gca gca tag 
Gly La Ala Glu Val Val Ala Ser Leu Val Pro Ala Ala 

450 



1064 



1112 



1160 



1208 



1256 



1304 



1352 



1400 



1442 



455 460 



<210> 8 
<211> 462 
<212> PRT 

<213> Exophiala spinifera 

Asp Asn'va^ La Asp Val Val Val Val Gly Ala Gly Leu Ser Gly Leu 

10 ^ 



Glu Thr Ala Arg Lys Val Gin Ala Ala oiy Leu Ser Cys Leu Val Leu 
Glu Ala Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser Gly 
Pro Gly Arg Thr Thr He Asn Asp Leu Gly Ala Ala Trp He Asn Asp 
Ser Asn Gin Ser Glu Val ler Arg Leu Phe Glu Arg Phe His Leu Glu 



Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser He His Gin Ala Gin Asp 

90 yb 



75 



Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu Glu 

7 



100 

Val Ala Ser Ala 
115 

Glu Glu His Ser 
130 

Leu Asp Ser Val 
145 

Pro Ala Val Leu 

Val Glu Ala His 
180 

Ser Ala Thr Gly 
195 

Gin Tyr Val Arg 
210 

Ser Lys Glu Leu 
225 

Glu lie Glu Gin 

Ala Val Phe Arg 
260 

Tyr Pro Thr Leu 
275 

Leu Ala Glu Asn 
290 

Trp Asp Lys Pro 
305 

Ser Ser Cys Asp 

Asp Arg Gin Trp 
340 

Lys Trp Ser Gin 
355 

Gin Leu Arg Ala 
370 

Ala Asn Val Leu 
385 

Ala Pro Ser Ala 

Ala Leu Arg Thr 
420 

Ser Leu Val Trp 
435 

Arg Gly Ala Ala 
450 



Leu Ala Glu Leu 
120 

Leu Gin Asp Leu 
135 

Ser Phe Ala His 
150 

Gly Val Ala Asn 
165 

Glu lie Ser Met 

Leu Ser Asn lie 
200 

Cys Lys Thr Gly 
215 

Val Pro Gly Ser 
230 

Ser Ala Ser Gly 
245 

Ser Lys Lys Val 

Thr Phe Ser Pro 
280 

Ser lie Leu Gly 
295 

Trp Trp Arg Glu 
310 

Pro lie Ser Phe 
325 

Ser lie Thr Cys 

Gin Ser Lys Gin 
360 

Ala Tyr Glu Asn 
375 

Glu lie Glu Trp 
390 

Val Tyr Gly Leu 
405 

Pro Phe Lys Ser 

Lys Gly Tyr Met 
440 

Glu Val Val Ala 
455 



105 

Leu Pro Val Trp 

Lys Ala Ser Pro 
140 

Tyr Cys Glu Lys 
155 

Gin lie Thr Arg 
170 

Leu Phe Leu Thr 
185 

Phe Ser Asp Lys 

Met Gin Ser lie 
220 

Val His Leu Asn 
235 

Cys Thr Val Arg 
250 

Val Val Ser Leu 
265 

Pro Leu Pro Ala 

Tyr Tyr Ser Lys 
300 

Gin Gly Phe Ser 
315 

Ala Arg Asp Thr 
330 

Phe Met Val Gly 
345 

Val Arg Gin Lys 

Ala Gly Ala Gin 
380 

Ser Lys Gin Gin 
395 

Asn Asp Leu lie 
410 

Val His Phe Val 
425 

Glu Gly Ala lie 

Ser Leu Val Pro 
460 



110 

Ser Gin Leu lie 
12 5 

Gin Ala Lys Arg 

Glu Leu Asn Leu 
160 

Ala Leu Leu Gly 
175 

Asp Tyr lie Lys 
190 

Lys Asp Gly Gly 
205 

Cys His Ala Met 

Thr Pro Val Ala 
240 

Ser Ala Ser Gly 
255 

Pro Thr Thr Leu 
270 

Glu Lys Gin Ala 
285 

lie Val Phe Val 

Gly Val Leu Gin 
320 

Ser lie Asp Val 
335 

Asp Pro Gly Arg 
350 

Ser Val Trp Asp 
365 

Val Pro Glu Pro 

Tyr Phe Gin Gly 
400 

Thr Leu Gly Ser 
415 

Gly Thr Glu Thr 
430 

Arg Ser Gly Gin 
445 

Ala Ala 



<210> 9 
<211> 458 
<212> PRT 

<213> Exophiala spinifera 



<400> 9 



Asp 


Asn 


Val 


Ala 


Asp 


Val 


Val 


Val 


Val 


Gly Ala 


Gly Leu 


Ser 


Gly 


Leu 


1 








5 










10 








15 




Glu 


Thr 


Ala 


Arg 


Lys 


Val 


Gin 


Ala 


Ala 


Gly Leu 


Ser 


Cys 


Leu 


Val 


Leu 








20 










25 








30 






Glu 


Ala 


Met 


Asp 


Arg 


Val 


Gly Gly Lys 


Thr Leu 


Ser 


Val 


Gin 


Ser 


Gly 






35 










40 








45 








Pro 


Gly Arg 


Thr 


Thr 


He 


Asn 


Asp 


Leu Gly Ala Ala 


Trp 


He 


Asn 


Asp 




50 










55 








60 










Ser 


Asn 


Gin 


Ser 


Glu 


Val 


Ser 


Arg 


Leu 


Phe Glu 


Arg 


Phe 


His 


Leu 


Glu 


65 










70 








75 










80 



8 



Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser lie His Gin Ala Gin Asp 

85 90 95 

Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu Glu 

100 105 110 

Val Ala Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu He 

115 120 125 

Glu Glu His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys Arg 

130 135 140 

Leu Asp Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn Leu 
145 150 155 160 

Pro Ala Val Leu Gly Val Ala Asn Gin He Thr Arg Ala Leu Leu Gly 

165 170 175 

Val Glu Ala His Glu He Ser Met Leu Phe Leu Thr Asp Tyr He Lys 

180 185 190 

Ser Ala Thr Gly Leu Ser Asn He Phe Ser Asp Lys Lys Asp Gly Gly 

195 200 205 

Gin Tyr Val Arg Cys Lys Thr Gly Ala Cys Gly Val Val Ser Gly Gly 

210 215 220 

Gly Leu Val Ser Gin Trp Ser Phe Gin Val Cys Ser Arg Phe Ala Met 
225 230 235 240 

Pro Cys Gin Arg Asn Leu Phe Gin Ala Gin Cys Thr Ser Thr Pro Pro 

245 250 255 

Ser Leu Lys Leu Ser Ser Arg His Pro Ala Val Gin Tyr Asp Arg Pro 

260 265 270 

Arg Ala Pro Cys Ser Glu Ala Lys Arg Trp Trp Phe Arg Tyr Arg Gin 

275 280 285 

Pro Cys He Pro Pro His Phe His His Leu Phe Pro Pro Arg Ser Lys 

290 295 300 

His Trp Arg Lys He Leu Ser Trp Ala Thr He Ala Arg Ser Ser Tyr 
305 310 315 320 

Gly Thr Ser Arg Gly Gly Ala Asn Lys Ala Ser Arg Ala Ser Ser Asn 

325 330 335 

Arg Ala Val Thr Pro Ser His Leu Pro Glu He Pro Ala Ser Thr Ser 

340 345 350 

He Asp Asn Gly Pro Leu Pro Val Ser Trp Ser Glu Thr Arg Asp Gly 

355 360 365 

Ser Gly Pro Asn Ser Pro Ser Arg Tyr Asp Lys Ser Leu Ser Gly Thr 

370 375 380 

Asn Ser Ala Gin Pro Thr Arg Thr Pro Gly Pro Lys Ser Gin Ser Arg 
385 390 395 400 

Pro Thr Cys Ser Lys Ser Ser Gly Arg Ser Ser Ser He Ser Lys Glu 

405 410 415 

Leu Arg Ala Pro Ser Met Gly Thr He Ser Ser His Trp Val Arg Arg 

420 425 430 

Ser Glu Arg Arg Ser Arg Val Phe He Ser Leu Glu Arg Arg Arg Leu 

435 440 445 

Phe Gly Lys Gly He Trp Lys Gly Pro Tyr 
450 455 

<210> 10 
<211> 1392 
<212> DNA 

<213> Exophiala spinifera 

<220> 

<221> CDS 

<222> (1) . . . (1389) 

<221> misc_f eature 
<222> (1) . . . (3) 

<223> Extra lysine in K: trAPAO 
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<400> 10 

aaa gac aac gtt gcg gac gtg gta gtg gtg ggc get ggc ttg age ggt 48 
Lys Asp Asn Val Ala Asp Val Val Val Val Gly Ala Gly Leu Ser Gly 
15 10 15 

ttg gag acg gca cgc aaa gtc cag gec gec ggt ctg tec tgc etc gtt 96 
Leu Glu Thr Ala Arg Lys Val Gin Ala Ala Gly Leu Ser Cys Leu Val 
20 25 30 

ctt gag gcg atg gat cgt gta ggg gga aag act ctg age gta caa teg 144 
Leu Glu Ala Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser 
35 40 45 

ggt ccc ggc agg acg act ate aac gac etc ggc get gcg tgg ate aat 192 
Gly Pro Gly Arg Thr Thr lie Asn Asp Leu Gly Ala Ala Trp lie Asn 
50 55 60 

gac age aac caa age gaa gta tec aga ttg ttt gaa aga ttt cat ttg 24 0 

Asp Ser Asn Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu 
65 70 75 80 

gag ggc gag etc cag agg acg act gga aat tea ate cat caa gca caa 288 
Glu Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser lie His Gin Ala Gin 
85 90 95 

gac ggt aca acc act aca get cct tat ggt gac tec ttg ctg age gag 336 
Asp Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu 
100 105 110 

gag gtt gca agt gca ctt gcg gaa etc etc ccc gta tgg tct cag ctg 3 84 

Glu Val Ala Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu 
115 120 125 

ate gaa gag cat age ctt caa gac etc aag geg age cet cag gcg aag 432 
lie Glu Glu His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys 
130 135 140 

egg etc gac agt gtg age ttc gcg cac tac tgt gag aag gaa eta aac 480 
Arg Leu Asp Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn 
145 150 155 160 

ttg cct get gtt etc ggc gta gca aac cag ate aca cgc get ctg etc 528 
Leu Pro Ala Val Leu Gly Val Ala Asn Gin lie Thr Arg Ala Leu Leu 
165 170 175 

ggt gtg gaa gee cac gag ate age atg ctt ttt etc acc gac tac ate 576 
Gly Val Glu Ala His Glu lie Ser Met Leu Phe Leu Thr Asp Tyr lie 
180 185 190 

aag agt gec acc ggt etc agt aat att ttc teg gac aag aaa gac ggc 624 
Lys Ser Ala Thr Gly Leu Ser Asn lie Phe Ser Asp Lys Lys Asp Gly 
195 200 205 



ggg cag tat atg cga tgc aaa aca ggt atg cag teg att tgc cat gec 672 
Gly Gin Tyr Met Arg Cys Lys Thr Gly Met Gin Ser lie Cys His Ala 
210 215 220 

atg tea aag gaa ctt gtt cca ggc tea gtg cac etc aac acc ccc gtc 72 0 

Met Ser Lys Glu Leu Val Pro Gly Ser Val His Leu Asn Thr Pro Val 
225 230 235 240 



10 



get gaa att gag cag teg gca tec ggc tgt aca gta cga teg gee teg 
Ala Glu lie Glu Gin Ser Ala Ser Gly Cys Thr Val Arg Ser Ala Ser 
245 250 255 



768 



ggc gec gtg ttc cga age aaa aag gtg gtg gtt teg tta ccg aca acc 816 
Gly Ala Val Phe Arg Ser Lys Lys Val Val Val Ser Leu Pro Thr Thr 
260 265 270 

ttg tat ccc acc ttg aca ttt tea cca cct ctt ccc gee gag aag caa 864 
Leu Tyr Pro Thr Leu Thr Phe Ser Pro Pro Leu Pro Ala Glu Lys Gin 
275 280 285 

gca ttg gcg gaa aat tct ate ctg ggc tac tat age aag ata gtc ttc 912 
Ala Leu Ala Glu Asn Ser lie Leu Gly Tyr Tyr Ser Lys lie Val Phe 
290 295 300 

gta tgg gac aag ccg tgg tgg cgc gaa caa ggc ttc teg ggc gtc etc 960 
Val Trp Asp Lys Pro Trp Trp Arg Glu Gin Gly Phe Ser Gly Val Leu 
305 310 315 320 

caa teg age tgt gac ccc ate tea ttt gee aga gat acc age ate gac 1008 
Gin Ser Ser Cys Asp Pro lie Ser Phe Ala Arg Asp Thr Ser lie Asp 
325 330 335 

gtc gat cga caa tgg tec att acc tgt ttc atg gtc gga gac ccg gga 1056 
Val Asp Arg Gin Trp Ser lie Thr Cys Phe Met Val Gly Asp Pro Gly 
340 345 350 

egg aag tgg tec caa cag tec aag cag gta cga caa aag tct gtc tgg 1104 
Arg Lys Trp Ser Gin Gin Ser Lys Gin Val Arg Gin Lys Ser Val Trp 
355 360 365 

gac caa etc cgc gca gec tac gag aac gec ggg gec caa gtc cca gag 1152 
Asp Gin Leu Arg Ala Ala Tyr Glu Asn Ala Gly Ala Gin Val Pro Glu 
370 375 380 

ccg gec aac gtg etc gaa ate gag tgg teg aag cag cag tat ttc caa 1200 
Pro Ala Asn Val Leu Glu lie Glu Trp Ser Lys Gin Gin Tyr Phe Gin 
385 390 395 400 

gga get ccg age gee gtc tat ggg ctg aac gat etc ate aca ctg ggt 1248 
Gly Ala Pro Ser Ala Val Tyr Gly Leu Asn Asp Leu lie Thr Leu Gly 
405 410 415 

teg gcg etc aga acg ccg ttc aag agt gtt cat ttc gtt gga acg gag 12 96 

Ser Ala Leu Arg Thr Pro Phe Lys Ser Val His Phe Val Gly Thr Glu 
420 425 430 

acg tct tta gtt tgg aaa ggg tat atg gaa ggg gec ata cga teg ggt 1344 
Thr Ser Leu Val Trp Lys Gly Tyr Met Glu Gly Ala lie Arg Ser Gly 
435 440 445 

caa cga ggt get gca gaa gtt gtg get age ctg gtg cca gca gca 1389 
Gin Arg Gly Ala Ala Glu Val Val Ala Ser Leu Val Pro Ala Ala 
450 455 460 

tag 1392 

<210> 11 
<211> 463 
<212> PRT 

<213> Exophiala spinifera 
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Ala 
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Glu 
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Ser 
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Leu 
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Asp 
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<210> 12 



12 



<211> 34 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer sequence designed for cloning DNA into 
expression vectors, N23256 



<400> 12 

ggggaattca aagacaacgt tgcggacgtg gtag 



34 



<210> 13 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence designed for cloning DNA into 
expression vectors, N23259 



<400> 13 

ggggcggccg cctatgctgc tggcaccagg ctag 34 

<210> 14 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Designed oligonucleotide for 3' RACE, N21965 
<400> 14 

tggtttcgtt accgacaacc ttgtatccc 2 9 

<210> 15 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Designed oligonucleotide for 5' race, N21968 
<400> 15 

gagttggtcc cagacagact tttgtcgt 2 8 

<210> 16 
<211> 1673 
<212> DNA 

<213> Exophiala spinifera 
<220> 

<221> sig_peptide 
<222> (1) . , . (267) 

<223> yeast alpha mating factor secretion signal. 

<221> CDS 

<222> {!)... (1662) 



<400> 
atg aga ttt 



16 

cct tea att ttt act get gtt tta ttc gca gca tec tec 



48 
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Met Arg Phe Pro Ser He Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
-85 -80 -75 



gca tta get get cca gtc aac act aca aca gaa gat gaa acg gca caa 96 
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
-70 -65 -60 

att ccg get gaa get gtc ate ggt tac tea gat tta gaa ggg gat ttc 144 
He Pro Ala Glu Ala Val He Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
-55 -50 -45 

gat gtt get gtt ttg cca ttt tec aac age aca aat aac ggg tta ttg 192 
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
-40 -35 -30 

ttt ata aat act act att gee age att get get aaa gaa gaa ggg gta 24 0 

Phe He Asn Thr Thr lie Ala Ser He Ala Ala Lys Glu Glu Gly Val 
-25 -20 -15 -10 

tct etc gag aaa aga gag get gaa get gaa ttc aaa gac aac gtt gcg 288 
Ser Leu Glu Lys Arg Glu Ala Glu Ala Glu Phe Lys Asp Asn Val Ala 
-5 15 

gac gtg gta gtg gtg ggc get ggc ttg age ggt ttg gag acg gca cgc 336 
Asp Val Val Val Val Gly Ala Gly Leu Ser Gly Leu Glu Thr Ala Arg 
10 15 20 

aaa gtc cag gee gee ggt ctg tec tgc etc gtt ctt gag gcg atg gat 3 84 

Lys Val Gin Ala Ala Gly Leu Ser Cys Leu Val Leu Glu Ala Met Asp 
25 30 35 

cgt gta ggg gga aag act ctg age gta caa teg ggt ccc ggc agg acg 432 
Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser Gly Pro Gly Arg Thr 
40 45 50 55 

act ate aac gac etc ggc get gcg tgg ate aat gac age aac caa age 480 
Thr He Asn Asp Leu Gly Ala Ala Trp He Asn Asp Ser Asn Gin Ser 
60 65 70 

gaa gta tec aga ttg ttt gaa aga ttt cat ttg gag ggc gag etc cag 52 8 

Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu Glu Gly Glu Leu Gin 
75 80 85 

agg acg act gga aat tea ate cat caa gca caa gac ggt aca ace act 576 
Arg Thr Thr Gly Asn Ser He His Gin Ala Gin Asp Gly Thr Thr Thr 
90 95 100 

aca get cct tat ggt gac tec ttg ctg age gag gag gtt gca agt gca 624 
Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu Glu Val Ala Ser Ala 
105 110 115 

ctt gcg gaa etc etc ccc gta tgg tct cag ctg ate gaa gag cat age 672 
Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu He Glu Glu His Ser 
120 125 130 135 

ctt caa gac etc aag gcg age cct cag gcg aag egg etc gac agt gtg 72 0 

Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys Arg Leu Asp Ser Val 
140 145 150 

age ttc gcg cac tac tgt gag aag gaa eta aac ttg cct get gtt etc 768 
Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn Leu Pro Ala Val Leu 
155 160 165 
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ggc gta gca aac cag ate aca cgc get ctg etc ggt gtg gaa gec cac 816 
Gly Val Ala Asn Gin He Thr Arg Ala Leu Leu Gly Val Glu Ala His 
170 175 180 

gag ate age atg ctt ttt etc acc gac tac ate aag agt gee ace ggt 864 
Glu He Ser Met Leu Phe Leu Thr Asp Tyr He Lys Ser Ala Thr Gly 
185 190 195 

etc agt aat att ttc teg gac aag aaa gac ggc ggg cag tat atg cga 912 
Leu Ser Asn He Phe Ser Asp Lys Lys Asp Gly Gly Gin Tyr Met Arg 
200 205 210 215 

tgc aaa aca ggt atg cag teg att tgc cat gee atg tea aag gaa ctt 960 
Cys Lys Thr Gly Met Gin Ser He Cys His Ala Met Ser Lys Glu Leu 
220 225 230 

gtt cca ggc tea gtg cac etc aac acc ccc gtc get gaa att gag cag 1008 
Val Pro Gly Ser Val His Leu Asn Thr Pro Val Ala Glu He Glu Gin 
235 240 245 

teg gca tec ggc tgt aca gta cga teg gee teg ggc gee gtg ttc cga 1056 
Ser Ala Ser Gly Cys Thr Val Arg Ser Ala Ser Gly Ala Val Phe Arg 
250 255 260 

age aaa aag gtg gtg gtt teg tta ccg aca acc ttg tat ccc acc ttg 1104 
Ser Lys Lys Val Val Val Ser Leu Pro Thr Thr Leu Tyr Pro Thr Leu 
265 270 275 

aca ttt tea cca cct ctt ccc gee gag aag caa gca ttg gcg gaa aat 1152 
Thr Phe Ser Pro Pro Leu Pro Ala Glu Lys Gin Ala Leu Ala Glu Asn 
280 285 290 295 

tct ate ctg ggc tac tat age aag ata gtc ttc gta tgg gac aag ccg 12 00 

Ser He Leu Gly Tyr Tyr Ser Lys He Val Phe Val Trp Asp Lys Pro 
300 305 310 

tgg tgg cgc gaa caa ggc ttc teg ggc gtc etc caa teg age tgt gac 1248 
Trp Trp Arg Glu Gin Gly Phe Ser Gly Val Leu Gin Ser Ser Cys Asp 
315 320 325 

ccc ate tea ttt gee aga gat acc age ate gac gtc gat cga caa tgg 12 96 

Pro He Ser Phe Ala Arg Asp Thr Ser He Asp Val Asp Arg Gin Trp 
330 335 340 

tec att acc tgt ttc atg gtc gga gac ccg gga egg aag tgg tec caa 1344 
Ser He Thr Cys Phe Met Val Gly Asp Pro Gly Arg Lys Trp Ser Gin 
345 350 355 

cag tec aag cag gta cga caa aag tct gtc tgg gac caa etc cgc gca 13 92 

Gin Ser Lys Gin Val Arg Gin Lys Ser Val Trp Asp Gin Leu Arg Ala 
360 365 370 375 

gee tac gag aac gee ggg gee caa gtc cca gag ccg gec aac gtg etc 1440 
Ala Tyr Glu Asn Ala Gly Ala Gin Val Pro Glu Pro Ala Asn Val Leu 
380 385 390 

gaa ate gag tgg teg aag cag cag tat ttc caa gga get ccg age gec 14 88 

Glu He Glu Trp Ser Lys Gin Gin Tyr Phe Gin Gly Ala Pro Ser Ala 
395 400 405 

gtc tat ggg ctg aac gat etc ate aca ctg ggt teg gcg etc aga acg 1536 

15 



Val Tyr Gly Leu Asn Asp Leu lie Thr Leu Gly Ser Ala Leu Arg Thr 
410 415 420 



ccg ttc aag agt gtt cat ttc gtt gga acg gag acg tct tta gtt tgg 1584 
Pro Phe Lys Ser Val His Phe Val Gly Thr Glu Thr Ser Leu Val Trp 
425 430 435 

aaa ggg tat atg gaa ggg gcc ata cga teg ggt caa cga ggt get gca 1632 
Lys Gly Tyr Met Glu Gly Ala lie Arg Ser Gly Gin Arg Gly Ala Ala 
440 445 450 455 

gaa gtt gtg get age ctg gtg cca gca gca taggeggecg c 1673 
Glu Val Val Ala Ser Leu Val Pro Ala Ala 
460 465 



<210> 17 

<211> 554 

<212> PRT 

<213> Exophiala spinifera 
<220> 

<221> SIGNAL 

<222> (1) . . . (89) 

<400> 17 

Met Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 

-85 -80 -75 

Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 

-70 -65 -60 

lie Pro Ala Glu Ala Val lie Gly Tyr Ser Asp Leu Glu Gly Asp Phe 

-55 -50 -45 

Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 

-40 -35 -30 

Phe lie Asn Thr Thr lie Ala Ser lie Ala Ala Lys Glu Glu Gly Val 
-25 -20 -15 -10 

Ser Leu Glu Lys Arg Glu Ala Glu Ala Glu Phe Lys Asp ,Asn Val Ala 

-5 15 
Asp Val Val Val Val Gly Ala Gly Leu Ser Gly Leu Glu Thr Ala Arg 

10 15 20 

Lys Val Gin Ala Ala Gly Leu Ser Cys Leu Val Leu Glu Ala Met Asp 

25 30 35 

Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser Gly Pro Gly Arg Thr 
40 45 50 55 

Thr lie Asn Asp Leu Gly Ala Ala Trp He Asn Asp Ser Asn Gin Ser 

60 65 70 

Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu Glu Gly Glu Leu Gin 

75 80 85 

Arg Thr Thr Gly Asn Ser He His Gin Ala Gin Asp Gly Thr Thr Thr 

90 95 100 

Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu Glu Val Ala Ser Ala 

105 110 115 

Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu He Glu Glu His Ser 
120 125 130 135 

Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys Arg Leu Asp Ser Val 

140 145 150 

Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn Leu Pro Ala Val Leu 

155 160 165 

Gly Val Ala Asn Gin He Thr Arg Ala Leu Leu Gly Val Glu Ala His 

170 175 180 

Glu He Ser Met Leu Phe Leu Thr Asp Tyr He Lys Ser Ala Thr Gly 
185 190 195 
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Cys 


Lys 


Thr 


Gly 


Met 


Gin 


Ser 


He 


Cys 


His 


Ala 


Met 


Ser 


Lys 


Glu 


Leu 










220 










225 










230 




Val 


Pro 


Gly 


Ser 


Val 


His 


Leu 


Asn 


Thr 


Pro 


Val 


Ala 


Glu 


He 


Glu 


Gin 








235 










240 










245 






Ser 


Ala 


Ser 


Gly 


Cys 


Thr 


Val 


Arg 


Ser 


Ala 


Ser 


Gly 


Ala 


Val 


Phe 


Arg 
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Glu 
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Leu 
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Glu 


Asn 
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He 
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Ser 


Lys 


He 


Val 


Phe 


Val 


Trp 
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Lys 


Pro 
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Trp 


Trp 


Arg 


Glu 


Gin 


Gly 


Phe 
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Gly 


Val 


Leu 


Gin 


Ser 


Ser 


Cys 


Asp 
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He 
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He 
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Val 


Asp 
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Trp 
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He 
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Val 


Gly 


Asp 


Pro 


Gly 
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Lys 


Trp 


Ser 


Gin 
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350 










355 










Gin 


Ser 


Lys 


Gin 


Val 


Arg 


Gin 


Lys 


Ser 


Val 


Trp 


Asp 


Gin 


Leu 


Arg 


Ala 


360 










365 










370 










375 


Ala 


Tyr 


Glu 


Asn 


Ala 


Gly 


Ala 


Gin 


Val 


Pro 


Glu 


Pro 


Ala 


Asn 


Val 


Leu 










380 










385 










390 




Glu 


He 


Glu 


Trp 
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Lys 


Gin 


Gin 


Tyr 


Phe 


Gin 


Gly 


Ala 


Pro 


Ser 


Ala 








395 
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405 






Val 


Tyr 


Gly 
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Asn 


Asp 


Leu 


He 


Thr 


Leu 


Gly 


Ser 
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Leu 


Arg 


Thr 
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Val 


His 


Phe 


Val 
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Thr 


Glu 


Thr 


Ser 


Leu 


Val 


Trp 
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430 
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Lys 


Gly 


Tyr 


Met 


Glu 


Gly 
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He 


Arg 


Ser 


Gly 


Gin 


Arg 


Gly 


Ala 


Ala 


440 
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Val 
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Pro 
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460 465 

<210> 18 
<211> 2079 
<212> DNA 
<213> Unknown 

<220> 

<223> GST : K : t rAPAO 2079 nt. Translation starting at nt 1 
- 687, gst fusion + polylinker, 688-2076, 
KrtrAPAO, extra lysine underlined; 2077-2079, stop 
codon. For bacterial expression. 



<221> CDS 

<222> (1) . . . (2076) 

<221> mi sc_f eature 

<222> (1) . . . (687) 

<223> gst fusion + polylinker 

<221> mi sc_f eature 
<222> (688) . . . (2076) 
<223> KrtrAPAO 

<221> misc_f eature 

<222> (688) . . . (690) 
<223> Extra lysine 
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<400> 18 

atg tec cct ata eta ggt tat tgg aaa att aag ggc ctt gtg caa ccc 48 

Met Ser Pro lie Leu Gly Tyr Trp Lys lie Lys Gly Leu Val Gin Pro 
15 10 15 

act cga ctt ctt ttg gaa tat ctt gaa gaa aaa tat gaa gag cat ttg 96 
Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His Leu 
20 25 30 

tat gag cgc gat gaa ggt gat aaa tgg cga aac aaa aag ttt gaa ttg 144 
Tyr Glu Arg Asp Glu Gly Asp Lys Trp Arg Asn Lys Lys Phe Glu Leu 
35 40 45 

ggt ttg gag ttt ccc aat ctt cct tat tat att gat ggt gat gtt aaa 192 
Gly Leu Glu Phe Pro Asn Leu Pro Tyr Tyr He Asp Gly Asp Val Lys 
50 55 60 

tta aca cag tct atg gec ate ata cgt tat ata get gac aag cac aac 24 0 

Leu Thr Gin Ser Met Ala He He Arg Tyr He Ala Asp Lys His Asn 
65 70 75 80 

atg ttg ggt ggt tgt cca aaa gag cgt gca gag att tea atg ctt gaa 288 
Met Leu Gly Gly Cys Pro Lys Glu Arg Ala Glu He Ser Met Leu Glu 
85 90 95 

gga gcg gtt ttg gat att aga tac ggt gtt teg aga att gca tat agt 336 
Gly Ala Val Leu Asp He Arg Tyr Gly Val Ser Arg He Ala Tyr Ser 
100 105 110 

aaa gac ttt gaa act etc aaa gtt gat ttt ctt age aag eta cct gaa 3 84 

Lys Asp Phe Glu Thr Leu Lys Val Asp Phe Leu Ser Lys Leu Pro Glu 
115 120 125 

atg ctg aaa atg ttc gaa gat cgt tta tgt cat aaa aca tat tta aat 432 
Met Leu Lys Met Phe Glu Asp Arg Leu Cys His Lys Thr Tyr Leu Asn 
130 135 140 

ggt gat cat gta acc cat cct gac ttc atg ttg tat gac get ctt gat 480 
Gly Asp His Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala Leu Asp 
145 150 155 160 

gtt gtt tta tac atg gac cca atg tgc ctg gat gcg ttc cca aaa tta 528 
Val Val Leu Tyr Met Asp Pro Met Cys Leu Asp Ala Phe Pro Lys Leu 
165 170 175 

gtt tgt ttt aaa aaa cgt att gaa get ate cca caa att gat aag tac 576 
Val Cys Phe Lys Lys Arg lie Glu Ala He Pro Gin He Asp Lys Tyr 
180 185 190 

ttg aaa tec age aag tat ata gca tgg cct ttg cag ggc tgg caa gec 624 
Leu Lys Ser Ser Lys Tyr He Ala Trp Pro Leu Gin Gly Trp Gin Ala 
195 200 205 

acg ttt ggt ggt ggc gac cat cct cca aaa teg gat ctg gtt ccg cgt 6 72 

Thr Phe Gly Gly Gly Asp His Pro Pro Lys Ser Asp Leu Val Pro Arg 
210 215 220 

gga tec ccg gaa ttc aaa gac aac gtt gcg gac gtg gta gtg gtg ggc 720 
Gly Ser Pro Glu Phe Lys Asp Asn Val Ala Asp Val Val Val Val Gly 
225 230 235 240 



get ggc ttg age ggt ttg gag acg gca cgc aaa gtc cag gec gec ggt 768 
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Ala Gly Leu Ser Gly Leu Glu Thr Ala Arg Lys Val Gin Ala Ala Gly 
245 250 255 



ctg tec tgc etc gtt ctt gag gcg atg gat cgt gta ggg gga aag act 816 
Leu Ser Cys Leu Val Leu Glu Ala Met Asp Arg Val Gly Gly Lys Thr 
260 265 270 



ctg age gta caa teg ggt ccc ggc agg acg act ate aac gac etc ggc 
Leu Ser Val Gin Ser Gly Pro Gly Arg Thr Thr lie Asn Asp Leu Gly 
275 280 285 

get gcg tgg ate aat gac age aac caa age gaa gta tec aga ttg ttt 
Ala Ala Trp lie Asn Asp Ser Asn Gin Ser Glu Val Ser Arg Leu Phe 
290 295 300 

gaa aga ttt cat ttg gag ggc gag etc cag agg acg act gga aat tea 
Glu Arg Phe His Leu Glu Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser 
305 310 315 320 



gag aag gaa eta aac ttg cct get gtt etc ggc gta gca aac cag ate 
Glu Lys Glu Leu Asn Leu Pro Ala Val Leu Gly Val Ala Asn Gin lie 
385 390 395 400 



etc ace gac tac ate aag agt gee ace ggt etc agt aat att ttc teg 
Leu Thr Asp Tyr lie Lys Ser Ala Thr Gly Leu Ser Asn lie Phe Ser 
420 425 430 



864 



912 



960 



ate cat caa gca caa gac ggt aca ace act aca get cct tat ggt gac 1008 
He His Gin Ala Gin Asp Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp 
325 330 335 

tec ttg ctg age gag gag gtt gca agt gca ctt gcg gaa etc etc ccc 1056 
Ser Leu Leu Ser Glu Glu Val Ala Ser Ala Leu Ala Glu Leu Leu Pro 
340 345 350 

gta tgg tct cag ctg ate gaa gag cat age ctt caa gac etc aag gcg 1104 
Val Trp Ser Gin Leu He Glu Glu His Ser Leu Gin Asp Leu Lys Ala 
355 360 365 

age cct cag gcg aag egg etc gac agt gtg age ttc gcg cac tac tgt 1152 
Ser Pro Gin Ala Lys Arg Leu Asp Ser Val Ser Phe Ala His Tyr Cys 
370 375 380 



1200 



aca cgc get ctg etc ggt gtg gaa gee cac gag ate age atg ctt ttt 1248 
Thr Arg Ala Leu Leu Gly Val Glu Ala His Glu He Ser Met Leu Phe 
405 410 415 



1296 



gac aag aaa gac ggc ggg cag tat atg cga tgc aaa aca ggt atg cag 1344 
Asp Lys Lys Asp Gly Gly Gin Tyr Met Arg Cys Lys Thr Gly Met Gin 
435 440 445 

teg att tgc cat gee atg tea aag gaa ctt gtt cca ggc tea gtg cac 13 92 

Ser He Cys His Ala Met Ser Lys Glu Leu Val Pro Gly Ser Val His 
450 455 460 

etc aac acc ccc gtc get gaa att gag cag teg gca tec ggc tgt aca 1440 
Leu Asn Thr Pro Val Ala Glu He Glu Gin Ser Ala Ser Gly Cys Thr 
465 470 475 480 

gta cga teg gec teg ggc gee gtg ttc cga age aaa aag gtg gtg gtt 1488 
Val Arg Ser Ala Ser Gly Ala Val Phe Arg Ser Lys Lys Val Val Val 
485 490 495 

19 



teg tta ccg aca acc ttg tat ccc acc ttg aca ttt tea cca cct ctt 
Ser Leu Pro Thr Thr Leu Tyr Pro Thr Leu Thr Phe Ser Pro Pro Leu 
500 505 510 



1536 



ccc gec gag aag caa gca ttg gcg gaa aat tct ate ctg ggc tac tat 1584 
Pro Ala Glu Lys Gin Ala Leu Ala Glu Asn Ser He Leu Gly Tyr Tyr 
515 520 525 

age aag ata gtc ttc gta tgg gac aag ccg tgg tgg cgc gaa caa ggc 1632 
Ser Lys He Val Phe Val Trp Asp Lys Pro Trp Trp Arg Glu Gin Gly 
530 535 540 

ttc teg ggc gtc etc caa teg age tgt gac ccc ate tea ttt gec aga 1680 
Phe Ser Gly Val Leu Gin Ser Ser Cys Asp Pro He Ser Phe Ala Arg 
545 550 555 560 

gat acc age ate gac gtc gat cga caa tgg tec att acc tgt ttc atg 1728 
Asp Thr Ser He Asp Val Asp Arg Gin Trp Ser He Thr Cys Phe Met 
565 570 575 

gtc gga gac ccg gga egg aag tgg tec caa cag tec aag cag gta cga 1776 
Val Gly Asp Pro Gly Arg Lys Trp Ser Gin Gin Ser Lys Gin Val Arg 
580 585 590 

caa aag tct gtc tgg gac caa etc cgc gca gee tac gag aac gee ggg 1824 
Gin Lys Ser Val Trp Asp Gin Leu Arg Ala Ala Tyr Glu Asn Ala Gly 
595 600 605 

gec caa gtc cca gag ccg gee aac gtg etc gaa ate gag tgg teg aag 1872 
Ala Gin Val Pro Glu Pro Ala Asn Val Leu Glu He Glu Trp Ser Lys 
610 615 620 

cag cag tat ttc caa gga get ccg age gee gtc tat ggg ctg aac gat 1920 
Gin Gin Tyr Phe Gin Gly Ala Pro Ser Ala Val Tyr Gly Leu Asn Asp 
625 630 635 640 

etc ate aca ctg ggt teg gcg etc aga acg ccg ttc aag agt gtt cat 1968 
Leu He Thr Leu Gly Ser Ala Leu Arg Thr Pro Phe Lys Ser Val His 
645 650 655 

ttc gtt gga acg gag acg tct tta gtt tgg aaa ggg tat atg gaa ggg 2 016 

Phe Val Gly Thr Glu Thr Ser Leu Val Trp Lys Gly Tyr Met Glu Gly 
660 665 670 

gec ata cga teg ggt caa cga ggt get gca gaa gtt gtg get age ctg 2 064 

Ala He Arg Ser Gly Gin Arg Gly Ala Ala Glu Val Val Ala Ser Leu 
675 680 685 

gtg cca gca gca tag 2079 
Val Pro Ala Ala 
690 



<210> 19 
<211> 692 
<212> PRT 
<213> Unknown 

<400> 19 

Met Ser Pro He Leu Gly Tyr Trp Lys He Lys Gly Leu Val Gin Pro 
15 10 15 

20 



Thr Arg Leu Leu 
20 

Tyr Glu Arg Asp 
35 

Gly Leu Glu Phe 
50 

Leu Thr Gin Ser 
65 

Met Leu Gly Gly 

Gly Ala Val Leu 
100 

Lys Asp Phe Glu 
115 

Met Leu Lys Met 
130 

Gly Asp His Val 
145 

Val Val Leu Tyr 

Val Cys Phe Lys 
180 

Leu Lys Ser Ser 
195 

Thr Phe Gly Gly 
210 

Gly Ser Pro Glu 
225 

Ala Gly Leu Ser 

Leu Ser Cys Leu 
260 

Leu Ser Val Gin 
275 

Ala Ala Trp lie 
290 

Glu Arg Phe His 
305 

lie His Gin Ala 

Ser Leu Leu Ser 
340 

Val Trp Ser Gin 
355 

Ser Pro Gin Ala 
370 

Glu Lys Glu Leu 
385 

Thr Arg Ala Leu 

Leu Thr Asp Tyr 
420 

Asp Lys Lys Asp 
435 

Ser lie Cys His 
450 

Leu Asn Thr Pro 
465 

Val Arg Ser Ala 

Ser Leu Pro Thr 
500 



Leu Glu Tyr Leu 

Glu Gly Asp Lys 
40 

Pro Asn Leu Pro 
55 

Met Ala lie lie 
70 

Cys Pro Lys Glu 
85 

Asp He Arg Tyr 

Thr Leu Lys Val 
120 

Phe Glu Asp Arg 
135 

Thr His Pro Asp 
150 

Met Asp Pro Met 
165 

Lys Arg He Glu 

Lys Tyr He Ala 
200 

Gly Asp His Pro 
215 

Phe Lys Asp Asn 
230 

Gly Leu Glu Thr 
245 

Val Leu Glu Ala 

Ser Gly Pro Gly 
280 

Asn Asp Ser Asn 
295 

Leu Glu Gly Glu 
310 

Gin Asp Gly Thr 
325 

Glu Glu Val Ala 

Leu He Glu Glu 
360 

Lys Arg Leu Asp 
375 

Asn Leu Pro Ala 
390 

Leu Gly Val Glu 
405 

He Lys Ser Ala 

Gly Gly Gin Tyr 
440 

Ala Met Ser Lys 
455 

Val Ala Glu He 
470 

Ser Gly Ala Val 
485 

Thr Leu Tyr Pro 



Glu Glu Lys Tyr 
25 

Trp Arg Asn Lys 

Tyr Tyr He Asp 
60 

Arg Tyr He Ala 
75 

Arg Ala Glu He 
90 

Gly Val Ser Arg 
105 

Asp Phe Leu Ser 

Leu Cys His Lys 
140 

Phe Met Leu Tyr 
155 

Cys Leu Asp Ala 
170 

Ala He Pro Gin 
185 

Trp Pro Leu Gin 

Pro Lys Ser Asp 
220 

Val Ala Asp Val 
235 

Ala Arg Lys Val 
250 

Met Asp Arg Val 
265 

Arg Thr Thr He 

Gin Ser Glu Val 
300 

Leu Gin Arg Thr 
315 

Thr Thr Thr Ala 
330 

Ser Ala Leu Ala 
345 

His Ser Leu Gin 

Ser Val Ser Phe 
380 

Val Leu Gly Val 
395 

Ala His Glu He 
410 

Thr Gly Leu Ser 
425 

Met Arg Cys Lys 

Glu Leu Val Pro 
460 

Glu Gin Ser Ala 
475 

Phe Arg Ser Lys 
490 

Thr Leu Thr Phe 
505 

21 



Glu Glu His Leu 
30 

Lys Phe Glu Leu 
45 

Gly Asp Val Lys 

Asp Lys His Asn 
80 

Ser Met Leu Glu 
95 

He Ala Tyr Ser 
110 

Lys Leu Pro Glu 
125 

Thr Tyr Leu Asn 

Asp Ala Leu Asp 
160 

Phe Pro Lys Leu 
175 

He Asp Lys Tyr 
190 

Gly Trp Gin Ala 
205 

Leu Val Pro Arg 

Val Val Val Gly 
240 

Gin Ala Ala Gly 
255 

Gly Gly Lys Thr 
270 

Asn Asp Leu Gly 
285 

Ser Arg Leu Phe 

Thr Gly Asn Ser 
320 

Pro Tyr Gly Asp 
335 

Glu Leu Leu Pro 
350 

Asp Leu Lys Ala 
365 

Ala His Tyr Cys 

Ala Asn Gin He 
400 

Ser Met Leu Phe 
415 

Asn He Phe Ser 
430 

Thr Gly Met Gin 
445 

Gly Ser Val His 

Ser Gly Cys Thr 
480 

Lys Val Val Val 
495 

Ser Pro Pro Leu 
510 



Pro 


Ala 


Glu Lys 


Cl -n Til a 
lj-Ln Ala 


Leu 


Ala 


Glu 


Asn 


Ser 


He 


Leu 


Gly 


Tyr 


Tyr 






515 






con 










525 








Ser 


Lys 


He Val 


rile V ct-L 


ixp 


Asp 


Lys 


Pro 


Tro 


Trp 


Arg 


Glu 


Gin 


Gly 




530 






r- -> rr 










3 *± \J 










Phe 


Ser Gly Val 


Leu Gin 


Ser 


Ser 


v y £? 


Asp 


Pro 


He 


Ser 


Phe 


Ala 


Arq 


545 






c c r\ 










555 










560 


Asp 


Thr 


Ser He 


Asp vai. 


Asp 


Arg 


Gin 




Ser 


He 


Thr 


Cys 


Phe 


Met 






bbt> 








570 










575 




Val 


Gly Asp Pro 


Gly Arg 


Lys 


irp 


Jul 


Gin 


Gin 


Ser 


Lys 


Gin 


Val 


Arg 






580 








3 O J 










590 






Gin 


Lys 


Ser Val 


Trp Asp 


Gin 


Leu 




Ala 


Ala 


Tvr 


Glu 


Asn 


Ala 


Gly 




595 






buu 










605 








Ala 


Gin 


Val Pro 


Glu Pro 


Ala 


Asn 


Val 


Leu 


OX Ui 


He 


Glu 


Trn 


Ser 


Lvs 




610 






615 










con 

O ^£ \J 










Gin 


Gin 


Tyr Phe 


Gin Gly Ala 


Pro 


Ser 




Val 




Glv 


Leu 


Asn 


Asp 


625 




630 










635 










640 


Leu 


He 


Thr Leu 


Gly Ser 


Ala 


Leu 


Arg 


Thr 


Pro 


Phe 


Lys 


Ser 


Val 


His 








645 








650 










655 




Phe 


Val 


Gly Thr 


Glu Thr 


Ser 


Leu 


Val 


Trp 


Lys 


Gly 


Tyr 


Met 


Glu 


Gly 






660 








665 










670 






Ala 


He 


Arg Ser 


Gly Gin Arg 


Gly Ala 


Ala 


Glu 


Val 


Val 


Ala 


Ser 


Leu 






675 






680 










685 









Val Pro Ala Ala 



690 

<210> 20 
<211> 1464 
<212> DNA 
< 2 1 3 > Unknown 

<220> 

<2 23> Nucleotide sequence of K : trAPAO translational 

fusion with barley alpha amylase signal sequence, 
for expression and secretion of the mature trAPAO 
in maize. Nucleotides 1-72, barley alpha amylase 
signal sequence, nucleotides 73-75, added lysine 
residue; nucleotides 7 6 -14 64 , trAPAO cDNA. 



<2 21> sig_peptide 
<222> (1) . . . (72) 

<223> Barley alpha amylase signal sequence 

<221> misc_feature 
<222> (73) . . . (1464) 
<223> K : trAPAOcDNA 

<221> CDS 

<222> (1) . . - (1461) 

<221> misc_feature 

<222> (73) . . . (75) 

<223> Added lysine residue 

<400> 20 

atg gec aac aag cac ctg age etc tec etc ttc etc gtg etc etc ggc 48 

Met Ala Asn Lys His Leu Ser Leu Ser Leu Phe Leu Val Leu Leu Gly 

-20 -15 " 10 

etc tec gec tec etc gec age ggc aaa gac aac gtt gcg gac gtg gta 96 
Leu Ser Ala Ser Leu Ala Ser Gly Lys Asp Asn Val Ala Asp Val Val 
-5 15 



22 



gtg gtg ggc get ggc ttg age ggt ttg gag acg gca cgc aaa gtc cag 144 
Val Val Gly Ala Gly Leu Ser Gly Leu Glu Thr Ala Arg Lys Val Gin 
10 15 20 

gec gec ggt ctg tec tgc etc gtt ctt gag gcg atg gat cgt gta ggg 192 
Ala Ala Gly Leu Ser Cys Leu Val Leu Glu Ala Met Asp Arg Val Gly 
25 30 35 40 

gga aag act ctg age gta caa teg ggt ccc ggc agg acg act ate aac 240 
Gly Lys Thr Leu Ser Val Gin Ser Gly Pro Gly Arg Thr Thr lie Asn 
45 50 55 

gac etc ggc get gcg tgg ate aat gac age aac caa age gaa gta tec 2 88 

Asp Leu Gly Ala Ala Trp lie Asn Asp Ser Asn Gin Ser Glu Val Ser 
60 65 70 

aga ttg ttt gaa aga ttt cat ttg gag ggc gag etc cag agg acg act 33 6 

Arg Leu Phe Glu Arg Phe His Leu Glu Gly Glu Leu Gin Arg Thr Thr 
75 80 85 

gga aat tea ate cat caa gca caa gac ggt aca acc act aca get cct 384 
Gly Asn Ser lie His Gin Ala Gin Asp Gly Thr Thr Thr Thr Ala Pro 
90 95 100 

tat ggt gac tec ttg ctg age gag gag gtt gca agt gca ctt gcg gaa 43 2 

Tyr Gly Asp Ser Leu Leu Ser Glu Glu Val Ala Ser Ala Leu Ala Glu 
105 110 115 120 

etc etc ccc gta tgg tct cag ctg ate gaa gag cat age ctt caa gac 480 
Leu Leu Pro Val Trp Ser Gin Leu lie Glu Glu His Ser Leu Gin Asp 
125 130 135 

etc aag gcg age cct cag gcg aag egg etc gac agt gtg age ttc gcg 52 8 

Leu Lys Ala Ser Pro Gin Ala Lys Arg Leu Asp Ser Val Ser Phe Ala 
140 145 150 

cac tac tgt gag aag gaa eta aac ttg cct get gtt etc ggc gta gca 576 
His Tyr Cys Glu Lys Glu Leu Asn Leu Pro Ala Val Leu Gly Val Ala 
155 160 165 

aac cag ate aca cgc get ctg etc ggt gtg gaa gee cac gag ate age 624 
Asn Gin lie Thr Arg Ala Leu Leu Gly Val Glu Ala His Glu lie Ser 
170 175 180 

atg ctt ttt etc acc gac tac ate aag agt gee acc ggt etc agt aat 672 
Met Leu Phe Leu Thr Asp Tyr lie Lys Ser Ala Thr Gly Leu Ser Asn 
185 190 195 200 

att ttc teg gac aag aaa gac ggc ggg cag tat atg cga tgc aaa aca 72 0 

lie Phe Ser Asp Lys Lys Asp Gly Gly Gin Tyr Met Arg Cys Lys Thr 
205 210 215 

ggt atg cag teg att tgc cat gee atg tea aag gaa ctt gtt cca ggc 768 
Gly Met Gin Ser lie Cys His Ala Met Ser Lys Glu Leu Val Pro Gly 
220 225 230 

tea gtg cac etc aac acc ccc gtc get gaa att gag cag teg gca tec 816 
Ser Val His Leu Asn Thr Pro Val Ala Glu lie Glu Gin Ser Ala Ser 
235 240 245 



ggc tgt aca gta cga teg gec teg ggc gee gtg ttc cga age aaa aag 864 

23 



Gly Cys Thr Val Arg Ser Ala Ser Gly Ala Val Phe Arg Ser Lys Lys 
250 255 260 



gtg gtg gtt teg tta ccg aca acc ttg tat ccc acc ttg aca ttt tea 912 
Val Val Val Ser Leu Pro Thr Thr Leu Tyr Pro Thr Leu Thr Phe Ser 
265 270 275 280 

cca cct ctt ccc gec gag aag caa gca ttg gcg gaa aat tct ate ctg 
Pro Pro Leu Pro Ala Glu Lys Gin Ala Leu Ala Glu Asn Ser lie Leu 
285 290 295 

ggc tac tat age aag ata gtc ttc gta tgg gac aag ccg tgg tgg cgc 
Gly Tyr Tyr Ser Lys He Val Phe Val Trp Asp Lys Pro Trp Trp Arg 
300 305 310 

gaa caa ggc ttc teg ggc gtc etc caa teg age tgt gac ccc ate tea 
Glu Gin Gly Phe Ser Gly Val Leu Gin Ser Ser Cys Asp Pro He Ser 
315 320 325 

ttt gee aga gat acc age ate gac gtc gat cga caa tgg tec att acc 
Phe Ala Arg Asp Thr Ser He Asp Val Asp Arg Gin Trp Ser He Thr 
330 335 340 

tgt ttc atg gtc gga gac ccg gga egg aag tgg tec caa cag tec aag 1152 
Cys Phe Met Val Gly Asp Pro Gly Arg Lys Trp Ser Gin Gin Ser Lys 
345 350 355 360 

cag gta cga caa aag tct gtc tgg gac caa etc cgc gca gee tac gag 120 0 

Gin Val Arg Gin Lys Ser Val Trp Asp Gin Leu Arg Ala Ala Tyr Glu 
365 370 375 

aac gee ggg gee caa gtc cca gag ccg gec aac gtg etc gaa ate gag 1248 
Asn Ala Gly Ala Gin Val Pro Glu Pro Ala Asn Val Leu Glu He Glu 
380 385 390 



960 



1008 



1056 



1104 



tgg teg aag cag cag tat ttc caa gga get ccg age gee gtc tat ggg 1296 
Trp Ser Lys Gin Gin Tyr Phe Gin Gly Ala Pro Ser Ala Val Tyr Gly 
395 400 405 

ctg aac gat etc ate aca ctg ggt teg gcg etc aga acg ccg ttc aag 1344 
Leu Asn Asp Leu He Thr Leu Gly Ser Ala Leu Arg Thr Pro Phe Lys 
410 415 420 

agt gtt cat ttc gtt gga acg gag acg tct tta gtt tgg aaa ggg tat 1392 
Ser Val His Phe Val Gly Thr Glu Thr Ser Leu Val Trp Lys Gly Tyr 
425 430 435 440 

atg gaa ggg gee ata cga teg ggt caa cga ggt get gca gaa gtt gtg 1440 
Met Glu Gly Ala He Arg Ser Gly Gin Arg Gly Ala Ala Glu Val Val 
445 450 455 



get age ctg gtg cca gca gca tag 
Ala Ser Leu Val Pro Ala Ala 
460 



1464 



<210> 21 
<211> 487 
<212> PRT 
<213> Unknown 



24 



<220> 

<221> SIGNAL 
<222> (1) . . . (24) 



<400> 21 
Met Ala Asn Lys 

Leu Ser Ala Ser 
-5 

Val Val Gly Ala 
10 

Ala Ala Gly Leu 
25 

Gly Lys Thr Leu 

Asp Leu Gly Ala 
60 

Arg Leu Phe Glu 
75 

Gly Asn Ser lie 
90 

Tyr Gly Asp Ser 
105 

Leu Leu Pro Val 

Leu Lys Ala Ser 
140 

His Tyr Cys Glu 
155 

Asn Gin lie Thr 
170 

Met Leu Phe Leu 
185 

lie Phe Ser Asp 

Gly Met Gin Ser 
220 

Ser Val His Leu 
235 

Gly Cys Thr Val 
250 

Val Val Val Ser 
265 

Pro Pro Leu Pro 

Gly Tyr Tyr Ser 
300 

Glu Gin Gly Phe 
315 

Phe Ala Arg Asp 
330 

Cys Phe Met Val 
345 

Gin Val Arg Gin 

Asn Ala Gly Ala 
380 

Trp Ser Lys Gin 
395 

Leu Asn Asp Leu 
410 

Ser Val His Phe 



His Leu Ser Leu 
-20 

Leu Ala Ser Gly 

Gly Leu Ser Gly 
15 

Ser Cys Leu Val 
30 

Ser Val Gin Ser 
45 

Ala Trp lie Asn 

Arg Phe His Leu 
80 

His Gin Ala Gin 
95 

Leu Leu Ser Glu 
110 

Trp Ser Gin Leu 
125 

Pro Gin Ala Lys 

Lys Glu Leu Asn 
160 

Arg Ala Leu Leu 
175 

Thr Asp Tyr lie 
190 

Lys Lys Asp Gly 
205 

lie Cys His Ala 

Asn Thr Pro Val 
240 

Arg Ser Ala Ser 
255 

Leu Pro Thr Thr 
270 

Ala Glu Lys Gin 
285 

Lys lie Val Phe 

Ser Gly Val Leu 
320 

Thr Ser lie Asp 
335 

Gly Asp Pro Gly 
350 

Lys Ser Val Trp 
365 

Gin Val Pro Glu 

Gin Tyr Phe Gin 
400 

lie Thr Leu Gly 
415 

Val Gly Thr Glu 



Ser Leu Phe Leu 
-15 

Lys Asp Asn Val 
1 

Leu Glu Thr Ala 
20 

Leu Glu Ala Met 
35 

Gly Pro Gly Arg 
50 

Asp Ser Asn Gin 
65 

Glu Gly Glu Leu 

Asp Gly Thr Thr 
100 

Glu Val Ala Ser 
115 

He Glu Glu His 
130 

Arg Leu Asp Ser 
145 

Leu Pro Ala Val 

Gly Val Glu Ala 
180 

Lys Ser Ala Thr 
195 

Gly Gin Tyr Met 
210 

Met Ser Lys Glu 
225 

Ala Glu He Glu 

Gly Ala Val Phe 
260 

Leu Tyr Pro Thr 
275 

Ala Leu Ala Glu 
290 

Val Trp Asp Lys 
305 

Gin Ser Ser Cys 

Val Asp Arg Gin 
340 

Arg Lys Trp Ser 
355 

Asp Gin Leu Arg 
370 

Pro Ala Asn Val 
385 

Gly Ala Pro Ser 

Ser Ala Leu Arg 
420 

Thr Ser Leu Val 
25 



Val Leu Leu Gly 
-10 

Ala Asp Val Val 
5 

Arg Lys Val Gin 

Asp Arg Val Gly 
40 

Thr Thr He Asn 
55 

Ser Glu Val Ser 
70 

Gin Arg Thr Thr 
85 

Thr Thr Ala Pro 

Ala Leu Ala Glu 
120 

Ser Leu Gin Asp 
135 

Val Ser Phe Ala 
150 

Leu Gly Val Ala 
165 

His Glu He Ser 

Gly Leu Ser Asn 
200 

Arg Cys Lys Thr 
215 

Leu Val Pro Gly 
230 

Gin Ser Ala Ser 
245 

Arg Ser Lys Lys 

Leu Thr Phe Ser 
280 

Asn Ser He Leu 
295 

Pro Trp Trp Arg 
310 

Asp Pro He Ser 
325 

Trp Ser He Thr 

Gin Gin Ser Lys 
360 

Ala Ala Tyr Glu 
375 

Leu Glu He Glu 
390 

Ala Val Tyr Gly 
405 

Thr Pro Phe Lys 
Trp Lys Gly Tyr 



425 430 435 440 

Met Glu Gly Ala He Arg Ser Gly Gin Arg Gly Ala Ala Glu Val Val 

445 450 455 

Ala Ser Leu Val Pro Ala Ala 
460 

<210> 22 
<211> 1803 
<212> DNA 

<213> Exophiala spinifera 

<220> 

<221> CDS 

<222> (1) . . . (1800) 

<400> 22 

atg gca ctt gca ccg age tac ate aat ccc cca aac gtc gec tec cca 
Met Ala Leu Ala Pro Ser Tyr He Asn Pro Pro Asn Val Ala Ser Pro 
! 5 10 15 



gta gtg gtg ggc get ggc ttg age ggt ttg gag acg gca cgc aaa gtc 
Val Val Val Gly Ala Gly Leu Ser Gly Leu Glu Thr Ala Arg Lys Val 
145 150 155 160 

cag gec gec ggt ctg tec tgc etc gtt ctt gag gcg atg gat cgt gta 
Gin Ala Ala Gly Leu Ser Cys Leu Val Leu Glu Ala Met Asp Arg Val 
165 170 175 

ggg gga aag act ctg age gta caa teg ggt ccc ggc agg acg act ate 
Gly Gly Lys Thr Leu Ser Val Gin Ser Gly Pro Gly Arg Thr Thr He 

26 



48 



96 



gca ggg tat tct cac gtc ggc gta ggc cca gac gga ggg agg tat gtg 
Ala Gly Tyr Ser His Val Gly Val Gly Pro Asp Gly Gly Arg Tyr Val 
20 25 30 

aca ata get gga cag att gga caa gac get teg ggc gtg aca gac cct 144 
Thr He Ala Gly Gin He Gly Gin Asp Ala Ser Gly Val Thr Asp Pro 
35 40 45 

gee tac gag aaa cag gtt gee caa gca ttc gec aat ctg cga get tgc 192 
Ala Tyr Glu Lys Gin Val Ala Gin Ala Phe Ala Asn Leu Arg Ala Cys 
50 55 60 

ctt get gca gtt gga gee act tea aac gac gtc ace aag etc aat tac 240 
Leu Ala Ala Val Gly Ala Thr Ser Asn Asp Val Thr Lys Leu Asn Tyr 
65 70 75 80 

tac ate gtc gac tac gee ccg age aaa etc ace gca att gga gat ggg 28 8 

Tyr He Val Asp Tyr Ala Pro Ser Lys Leu Thr Ala He Gly Asp Gly 
85 90 95 

ctg aag get ace ttt gee ctt gac agg etc cct cct tgc acg ctg gtg 33 6 

Leu Lys Ala Thr Phe Ala Leu Asp Arg Leu Pro Pro Cys Thr Leu Val 
100 105 HO 

cca gtg teg gee ttg tct tea cct gaa tac etc ttt gag gtt gat gee 384 
Pro Val Ser Ala Leu Ser Ser Pro Glu Tyr Leu Phe Glu Val Asp Ala 
115 120 125 

acg gcg ctg gtg ccg gga cac acg acc cca gac aac gtt gcg gac gtg 432 
Thr Ala Leu Val Pro Gly His Thr Thr Pro Asp Asn Val Ala Asp Val 
130 135 140 



480 



528 



576 



180 



185 



190 



tec ggc tgt aca gta cga teg gec teg ggc gec gtg ttc cga age aaa 
Ser Gly Cys Thr Val Arg Ser Ala Ser Gly Ala Val Phe Arg Ser Lys 
385 390 395 400 

aag gtg gtg gtt teg tta ccg aca acc ttg tat ccc acc ttg aca ttt 
Lys Val Val Val Ser Leu Pro Thr Thr Leu Tyr Pro Thr Leu Thr Phe 
405 410 415 

tea cca cct ctt ccc gec gag aag caa gca ttg gcg gaa aat tct ate 
Ser Pro Pro Leu Pro Ala Glu Lys Gin Ala Leu Ala Glu Asn Ser lie 
420 425 430 



768 



816 



864 



aac gac etc ggc get gcg tgg ate aat gac age aac caa age gaa gta 624 
Asn Asp Leu Gly Ala Ala Trp lie Asn Asp Ser Asn Gin Ser Glu Val 
195 200 205 

tec aga ttg ttt gaa aga ttt cat ttg gag ggc gag etc cag agg acg 672 
Ser Arg Leu Phe Glu Arg Phe His Leu Glu Gly Glu Leu Gin Arg Thr 
210 215 220 

act gga aat tea ate cat caa gca caa gac ggt aca acc act aca get 72 0 

Thr Gly Asn Ser lie His Gin Ala Gin Asp Gly Thr Thr Thr Thr Ala 
225 230 235 240 

cct tat ggt gac tec ttg ctg age gag gag gtt gca agt gca ctt gcg 
Pro Tyr Gly Asp Ser Leu Leu Ser Glu Glu Val Ala Ser Ala Leu Ala 
245 250 255 

gaa etc etc ccc gta tgg tct cag ctg ate gaa gag cat age ctt caa 
Glu Leu Leu Pro Val Trp Ser Gin Leu lie Glu Glu His Ser Leu Gin 
260 265 270 

gac etc aag gcg age cct cag gcg aag egg etc gac agt gtg age ttc 
Asp Leu Lys Ala Ser Pro Gin Ala Lys Arg Leu Asp Ser Val Ser Phe 
275 280 285 

gcg cac tac tgt gag aag gaa eta aac ttg cct get gtt etc ggc gta 912 
Ala His Tyr Cys Glu Lys Glu Leu Asn Leu Pro Ala Val Leu Gly Val 
290 295 300 

gca aac cag ate aca cgc get ctg etc ggt gtg gaa gee cac gag ate 960 
Ala Asn Gin lie Thr Arg Ala Leu Leu Gly Val Glu Ala His Glu lie 
305 310 315 320 

age atg ctt ttt etc acc gac tac ate aag agt gec acc ggt etc agt 1008 
Ser Met Leu Phe Leu Thr Asp Tyr lie Lys Ser Ala Thr Gly Leu Ser 
325 330 335 

aat att ttc teg gac aag aaa gac ggc ggg cag tat atg cga tgc aaa 1056 
Asn lie Phe Ser Asp Lys Lys Asp Gly Gly Gin Tyr Met Arg Cys Lys 
340 345 350 

aca ggt atg cag teg att tgc cat gec atg tea aag gaa ctt gtt cca 1104 
Thr Gly Met Gin Ser lie Cys His Ala Met Ser Lys Glu Leu Val Pro 
355 360 365 

ggc tea gtg cac etc aac acc ccc gtc get gaa att gag cag teg gca 1152 
Gly Ser Val His Leu Asn Thr Pro Val Ala Glu lie Glu Gin Ser Ala 
370 375 380 



1200 



1248 



1296 



27 



ctg ggc tac tat age aag ata gtc ttc gta tgg gac aag ccg tgg tgg 1344 
Leu Gly Tyr Tyr Ser Lys lie Val Phe Val Trp Asp Lys Pro Trp Trp 
435 440 445 

cgc gaa caa ggc ttc teg ggc gtc etc caa teg age tgt gac ccc ate 13 92 

Arg Glu Gin Gly Phe Ser Gly Val Leu Gin Ser Ser Cys Asp Pro lie 
450 455 460 

tea ttt gee aga gat ace age ate gac gtc gat cga caa tgg tec att 
Ser Phe Ala Arg Asp Thr Ser lie Asp Val Asp Arg Gin Trp Ser He 
465 470 475 480 



580 585 590 



gtg get age ctg gtg cca gca gca tag 
Val Ala Ser Leu Val Pro Ala Ala 
595 600 



<210> 23 
<211> 600 
<212> PRT 

<213> Exophiala spinifera 



<400> 23 

Met Ala Leu Ala Pro Ser Tyr He Asn Pro Pro Asn Val Ala Ser Pro 

15 10 15 

Ala Gly Tyr Ser His Val Gly Val Gly Pro Asp Gly Gly Arg Tyr Val 

20 25 30 

Thr He Ala Gly Gin He Gly Gin Asp Ala Ser Gly Val Thr Asp Pro 

35 40 45 

Ala Tyr Glu Lys Gin Val Ala Gin Ala Phe Ala Asn Leu Arg Ala Cys 

50 55 60 

Leu Ala Ala Val Gly Ala Thr Ser Asn Asp Val Thr Lys Leu Asn Tyr 
65 70 75 80 

Tyr He Val Asp Tyr Ala Pro Ser Lys Leu Thr Ala He Gly Asp Gly 

28 



1440 



acc tgt ttc atg gtc gga gac ccg gga egg aag tgg tec caa cag tec 1488 
Thr Cys Phe Met Val Gly Asp Pro Gly Arg Lys Trp Ser Gin Gin Ser 
485 490 495 

aag cag gta cga caa aag tct gtc tgg gac caa etc cgc gca gee tac 1536 
Lys Gin Val Arg Gin Lys Ser Val Trp Asp Gin Leu Arg Ala Ala Tyr 
500 505 510 

gag aac gee ggg gee caa gtc cca gag ccg gee aac gtg etc gaa ate 1584 
Glu Asn Ala Gly Ala Gin Val Pro Glu Pro Ala Asn Val Leu Glu He 
515 520 525 

gag tgg teg aag cag cag tat ttc caa gga get ccg age gec gtc tat 1632 
Glu Trp Ser Lys Gin Gin Tyr Phe Gin Gly Ala Pro Ser Ala Val Tyr 
530 535 540 

ggg c tg aac gat etc ate aca ctg ggt teg gcg etc aga acg ccg ttc 1680 
Gly Leu Asn Asp Leu He Thr Leu Gly Ser Ala Leu Arg Thr Pro Phe 
545 550 555 560 

aag agt gtt cat ttc gtt gga acg gag acg tct tta gtt tgg aaa ggg 172 8 

Lys Ser Val His Phe Val Gly Thr Glu Thr Ser Leu Val Trp Lys Gly 
565 570 575 

tat atg gaa ggg gee ata cga teg ggt caa cga ggt get gca gaa gtt 1776 
Tyr Met Glu Gly Ala He Arg Ser Gly Gin Arg Gly Ala Ala Glu Val 

- « - rnn 



1803 



Leu Lys Ala Thr 
100 

Pro Val Ser Ala 
115 

Thr Ala Leu Val 
130 

Val Val Val Gly 
145 

Gin Ala Ala Gly 

Gly Gly Lys Thr 
180 

Asn Asp Leu Gly 
195 

Ser Arg Leu Phe 
210 

Thr Gly Asn Ser 
225 

Pro Tyr Gly Asp 

Glu Leu Leu Pro 
260 

Asp Leu Lys Ala 
275 

Ala His Tyr Cys 
290 

Ala Asn Gin lie 
305 

Ser Met Leu Phe 

Asn lie Phe Ser 
340 

Thr Gly Met Gin 
355 

Gly Ser Val His 
370 

Ser Gly Cys Thr 
385 

Lys Val Val Val 

Ser Pro Pro Leu 
420 

Leu Gly Tyr Tyr 
435 

Arg Glu Gin Gly 
450 

Ser Phe Ala Arg 
465 

Thr Cys Phe Met 

Lys Gin Val Arg 
500 

Glu Asn Ala Gly 
515 

Glu Trp Ser Lys 
530 

Gly Leu Asn Asp 
545 

Lys Ser Val His 
Tyr Met Glu Gly 



85 

Phe Ala Leu Asp 

Leu Ser Ser Pro 
12 0 

Pro Gly His Thr 
135 

Ala Gly Leu Ser 
150 

Leu Ser Cys Leu 
165 

Leu Ser Val Gin 

Ala Ala Trp lie 
200 

Glu Arg Phe His 
215 

He His Gin Ala 

230 

Ser Leu Leu Ser 
245 

Val Trp Ser Gin 

Ser Pro Gin Ala 
280 

Glu Lys Glu Leu 
295 

Thr Arg Ala Leu 
310 

Leu Thr Asp Tyr 
325 

Asp Lys Lys Asp 

Ser He Cys His 
360 

Leu Asn Thr Pro 
375 

Val Arg Ser Ala 
390 

Ser Leu Pro Thr 
405 

Pro Ala Glu Lys 

Ser Lys He Val 
440 

Phe Ser Gly Val 
455 

Asp Thr Ser He 
470 

Val Gly Asp Pro 
485 

Gin Lys Ser Val 

Ala Gin Val Pro 
520 

Gin Gin Tyr Phe 
535 

Leu He Thr Leu 
550 

Phe Val Gly Thr 
565 

Ala He Arg Ser 



90 

Arg Leu Pro Pro 
105 

Glu Tyr Leu Phe 

Thr Pro Asp Asn 
140 

Gly Leu Glu Thr 
155 

Val Leu Glu Ala 
170 

Ser Gly Pro Gly 
185 

Asn Asp Ser Asn 

Leu Glu Gly Glu 
220 

Gin Asp Gly Thr 
235 

Glu Glu Val Ala 
250 

Leu He Glu Glu 
265 

Lys Arg Leu Asp 

Asn Leu Pro Ala 
300 

Leu Gly Val Glu 
315 

lie Lys Ser Ala 
330 

Gly Gly Gin Tyr 
345 

Ala Met Ser Lys 

Val Ala Glu He 
380 

Ser Gly Ala Val 
395 

Thr Leu Tyr Pro 
410 

Gin Ala Leu Ala 
425 

Phe Val Trp Asp 

Leu Gin Ser Ser 
460 

Asp Val Asp Arg 
475 

Gly Arg Lys Trp 
490 

Trp Asp Gin Leu 
505 

Glu Pro Ala Asn 

Gin Gly Ala Pro 
540 

Gly Ser Ala Leu 
555 

Glu Thr Ser Leu 
570 

Gly Gin Arg Gly 

29 



95 

Cys Thr Leu Val 
110 

Glu Val Asp Ala 
125 

Val Ala Asp Val 

Ala Arg Lys Val 
160 

Met Asp Arg Val 
175 

Arg Thr Thr He 
190 

Gin Ser Glu Val 
205 

Leu Gin Arg Thr 

Thr Thr Thr Ala 
240 

Ser Ala Leu Ala 
255 

His Ser Leu Gin 
270 

Ser Val Ser Phe 
285 

Val Leu Gly Val 

Ala His Glu He 
320 

Thr Gly Leu Ser 
335 

Met Arg Cys Lys 
350 

Glu Leu Val Pro 
365 

Glu Gin Ser Ala 

Phe Arg Ser Lys 
400 

Thr Leu Thr Phe 
415 

Glu Asn Ser He 
430 

Lys Pro Trp Trp 
445 

Cys Asp Pro He 

Gin Trp Ser He 
480 

Ser Gin Gin Ser 
495 

Arg Ala Ala Tyr 
510 

Val Leu Glu He 
525 

Ser Ala Val Tyr 

Arg Thr Pro Phe 
560 

Val Trp Lys Gly 
575 

Ala Ala Glu Val 



580 585 590 

Val Ala Ser Leu Val Pro Ala Ala 
595 600 



<210> 24 
<211> 3003 
<212> DNA 
<213> Unknown 

<220> 

<223> Sequence is barley alpha amylase signal sequence: 
espl mat: an artificial spacer sequence and 
K : trAPAO 



<221> sigjpeptide 
<222> (1) . . . (72) 

<223> Barley alpha amylase signal sequence 

<221> misc_f eature 
<222> (73) . . . (1575) 
<223> espl mat 

<22l> mi sc__f eature 
<222> (1576) . . . (1611) 
<223> spacer sequence 

<221> mi sc_f eature 
<222> (1612) . . . (3000) 
<223> K: trAPAO 

<221> CDS 

<222> (1) . . . (3000) 

<221> misc_f eature 
<222> (1612) . . . (1614) 
<223> Extra lysine 

<400> 24 

a tg gcc aac aag cac ctg age etc tec etc ttc etc gtg etc etc ggc 48 
Met Ala Asn Lys His Leu Ser Leu Ser Leu Phe Leu Val Leu Leu Gly 
-20 -15 -10 

etc tec gcc tec etc gcc age ggc get cct act gtc aag att gat get 96 
Leu Ser Ala Ser Leu Ala Ser Gly Ala Pro Thr Val Lys lie Asp Ala 
-5 15 



ggg atg gtg gtc ggc acg act act act gtc ccc ggc acc act gcg acc 144 
Gly Met Val Val Gly Thr Thr Thr Thr Val Pro Gly Thr Thr Ala Thr 
10 15 20 

gtc age gag ttc ttg ggc gtt cct ttt gcc gcc tct ccg aca cga ttt 192 
Val Ser Glu Phe Leu Gly Val Pro Phe Ala Ala Ser Pro Thr Arg Phe 
25 30 35 40 

gcg cct cct act cgt ccc gtg cct tgg tea acg cct ttg caa gcc act 240 
Ala Pro Pro Thr Arg Pro Val Pro Trp Ser Thr Pro Leu Gin Ala Thr 
45 50 55 

gca tat ggt cca gca tgc cct caa caa ttc aat tac ccc gaa gaa etc 288 
Ala Tyr Gly Pro Ala Cys Pro Gin Gin Phe Asn Tyr Pro Glu Glu Leu 
60 65 70 



30 



cgt gag att acg atg gcc tgg ttc aat aca ccg ccc ccg tea get ggt 336 
Arg Glu lie Thr Met Ala Trp Phe Asn Thr Pro Pro Pro Ser Ala Gly 
75 80 85 

gaa agt gag gac tgc ctg aac etc aac ate tac gtc cca gga act gag 384 
Glu Ser Glu Asp Cys Leu Asn Leu Asn He Tyr Val Pro Gly Thr Glu 
90 95 100 

aac aca aac aaa gcc gtc atg gtt tgg at a tac ggt gga gcg ctg gaa 432 
Asn Thr Asn Lys Ala Val Met Val Trp He Tyr Gly Gly Ala Leu Glu 
105 110 115 120 

tat ggt tgg aat tea ttc cac ctt tac gac ggg get agt ttc gca gcc 48 0 

Tyr Gly Trp Asn Ser Phe His Leu Tyr Asp Gly Ala Ser Phe Ala Ala 
125 130 135 

aat cag gat gtc ate gcc gtg acc ate aac tac aga acg aac att ctg 52 8 

Asn Gin Asp Val He Ala Val Thr He Asn Tyr Arg Thr Asn He Leu 
140 145 150 

ggg ttc cct get gcc cct cag ctt cca ata aca cag cga aat ctg ggg 576 
Gly Phe Pro Ala Ala Pro Gin Leu Pro He Thr Gin Arg Asn Leu Gly 
155 160 165 

ttc eta gac caa agg ttt get ttg gat tgg gta cag egg aac ate gca 624 
Phe Leu Asp Gin Arg Phe Ala Leu Asp Trp Val Gin Arg Asn He Ala 
170 175 180 

gcc ttt ggc ggt gat cct cga aag gtc aca ata ttt ggg cag agt gcg 6 72 

Ala Phe Gly Gly Asp Pro Arg Lys Val Thr He Phe Gly Gin Ser Ala 
185 190 195 200 

ggg gg c a 9 a a 9"t 9" tG 9 ac gtc etc ttg acg tct atg cca cac aac cca 720 
Gly Gly Arg Ser Val Asp Val Leu Leu Thr Ser Met Pro His Asn Pro 
205 210 215 

ccc ttc cga gca gca ate atg gag tec ggt gtg get aac tac aac ttc 768 
Pro Phe Arg Ala Ala He Met Glu Ser Gly Val Ala Asn Tyr Asn Phe 
220 225 230 

ccc aag gga gat ttg tec gaa cct tgg aac acc act gtt caa get etc 816 
Pro Lys Gly Asp Leu Ser Glu Pro Trp Asn Thr Thr Val Gin Ala Leu 
235 240 245 

aac tgt acc acc agt ate gac ate ttg agt tgt atg aga aga gtc gat 864 
Asn Cys Thr Thr Ser He Asp He Leu Ser Cys Met Arg Arg Val Asp 
250 255 260 

etc gcc act ctg atg aac acg ate gag caa etc gga ctt ggg ttt gag 912 
Leu Ala Thr Leu Met Asn Thr He Glu Gin Leu Gly Leu Gly Phe Glu 
265 270 275 280 

tac acg ttg gac aac gta acg get gtg tac cgt tct gaa acg get cgc 96 0 

Tyr Thr Leu Asp Asn Val Thr Ala Val Tyr Arg Ser Glu Thr Ala Arg 
285 290 295 

acg act ggt gac att get cgt gta cct gtt etc gtc ggg acg gtg gcc 1008 
Thr Thr Gly Asp He Ala Arg Val Pro Val Leu Val Gly Thr Val Ala 
300 305 310 

aac gac gga ctt etc ttt gtc etc ggg gag aat gac acc caa gca tat 1056 

31 



Asn Asp Gly Leu Leu Phe Val Leu Gly Glu Asn Asp Thr Gin Ala Tyr 
315 320 325 



etc gag gag gca ate ccg aat cag ccc gac ctt tac cag act etc ctt 1104 
Leu Glu Glu Ala lie Pro Asn Gin Pro Asp Leu Tyr Gin Thr Leu Leu 
330 335 340 

gga gca tat ccc att gga tec cca ggg ate gga teg cct caa gat cag 1152 
Gly Ala Tyr Pro lie Gly Ser Pro Gly lie Gly Ser Pro Gin Asp Gin 
345 350 355 360 

att gee gec att gag acc gag gta aga ttc cag tgt cct tct gee ate 12 0 0 

lie Ala Ala lie Glu Thr Glu Val Arg Phe Gin Cys Pro Ser Ala lie 
365 370 375 

gtg get cag gac tec egg aat egg ggt ate cct tct tgg cgc tac tac 1248 
Val Ala Gin Asp Ser Arg Asn Arg Gly lie Pro Ser Trp Arg Tyr Tyr 
380 385 390 

tac aat gcg acc ttt gag aat ctg gag ctt ttc cct ggg tec gaa gtg 12 96 

Tyr Asn Ala Thr Phe Glu Asn Leu Glu Leu Phe Pro Gly Ser Glu Val 
395 400 405 

tac cac age tct gaa gtc ggg atg gtg ttt ggc acg tat cct gtc gca 1344 
Tyr His Ser Ser Glu Val Gly Met Val Phe Gly Thr Tyr Pro Val Ala 
410 415 420 

agt gcg acc gee ttg gag gec cag acg age aaa tac atg cag ggt gee 1392 
Ser Ala Thr Ala Leu Glu Ala Gin Thr Ser Lys Tyr Met Gin Gly Ala 
425 430 435 440 

tgg 9^9 gec ttt gee aaa aac ccc atg aat ggg cct ggg tgg aaa caa 1440 
Trp Ala Ala Phe Ala Lys Asn Pro Met Asn Gly Pro Gly Trp Lys Gin 
445 450 455 

gtg ccg aat gtc gcg gcg ctt ggc tea cca ggc aaa gee ate cag gtt 14 88 

Val Pro Asn Val Ala Ala Leu Gly Ser Pro Gly Lys Ala lie Gin Val 
460 465 470 

gac gtc tct cca gcg aca ata gac caa cga tgt gec ttg tac acg cgt 1536 
Asp Val Ser Pro Ala Thr lie Asp Gin Arg Cys Ala Leu Tyr Thr Arg 
475 480 485 

tat tat act gag ttg ggc aca ate gcg ccg agg aca ttt ggc gga ggc 1584 
Tyr Tyr Thr Glu Leu Gly Thr lie Ala Pro Arg Thr Phe Gly Gly Gly 
490 495 500 

age ggc gga ggc age ggc gga ggc age aaa gac aac gtt gcg gac gtg 1632 
Ser Gly Gly Gly Ser Gly Gly Gly Ser Lys Asp Asn Val Ala Asp Val 
505 510 515 520 

gta gtg gtg ggc get ggc ttg age ggt ttg gag acg gca cgc aaa gtc 1680 
Val Val Val Gly Ala Gly Leu Ser Gly Leu Glu Thr Ala Arg Lys Val 
525 530 535 

cag gec gee ggt ctg tec tgc etc gtt ctt gag gcg atg gat cgt gta 1728 
Gin Ala Ala Gly Leu Ser Cys Leu Val Leu Glu Ala Met Asp Arg Val 
540 545 550 

ggg gga aag act ctg age gta caa teg ggt ccc ggc agg acg act ate 1776 
Gly Gly Lys Thr Leu Ser Val Gin Ser Gly Pro Gly Arg Thr Thr lie 
555 560 565 



32 



aac gac etc ggc get gcg tgg ate aat gac age aac caa age gaa gta 
Asn Asp Leu Gly Ala Ala Trp lie Asn Asp Ser Asn Gin Ser Glu Val 
570 575 580 



1824 



tec aga ttg 
Ser Arg Leu 
585 

act gga aat 
Thr Gly Asn 



cct tat ggt 
Pro Tyr Gly 



gaa etc etc 
Glu Leu Leu 
635 

gac etc aag 
Asp Leu Lys 
650 

gcg cac tac 
Ala His Tyr 
665 

gca aac cag 
Ala Asn Gin 



age atg ctt 
Ser Met Leu 



aat att ttc 
Asn lie Phe 
715 

aca ggt atg 
Thr Gly Met 
730 

ggc tea gtg 
Gly Ser Val 
745 

tec ggc tgt 
Ser Gly Cys 



aag gtg gtg 
Lys Val Val 



tea cca cct 
Ser Pro Pro 

795 



ttt gaa aga 
Phe Glu Arg 
590 

tea ate cat 
Ser lie His 
605 

gac tec ttg 
Asp Ser Leu 
620 

ccc gta tgg 
Pro Val Trp 



gcg age cct 
Ala Ser Pro 



tgt gag aag 
Cys Glu Lys 
670 

ate aca cgc 
lie Thr Arg 
685 

ttt etc ace 
Phe Leu Thr 
700 

teg gac aag 
Ser Asp Lys 



cag teg att 
Gin Ser He 



cac etc aac 
His Leu Asn 
750 

aca gta cga 
Thr Val Arg 
765 

gtt teg tta 
Val Ser Leu 
780 

ctt ccc gee 
Leu Pro Ala 



ttt cat ttg 
Phe His Leu 



caa gca caa 
Gin Ala Gin 



ctg age gag 
Leu Ser Glu 
625 

tct cag ctg 
Ser Gin Leu 
640 

cag gcg aag 
Gin Ala Lys 
655 

gaa eta aac 
Glu Leu Asn 



get ctg etc 
Ala Leu Leu 



gac tac ate 
Asp Tyr He 
705 

aaa gac ggc 
Lys Asp Gly 
720 

tgc cat gee 
Cys His Ala 
735 

ace ccc gtc 
Thr Pro Val 



teg gee teg 
Ser Ala Ser 



ccg aca ace 
Pro Thr Thr 
785 

gag aag caa 
Glu Lys Gin 

800 



gag ggc gag 
Glu Gly Glu 
595 

gac ggt aca 
Asp Gly Thr 
610 

gag gtt gca 
Glu Val Ala 



ate gaa gag 
He Glu Glu 



egg etc gac 
Arg Leu Asp 
660 

ttg cct get 
Leu Pro Ala 
675 

ggt gtg gaa 
Gly Val Glu 
690 

aag agt gee 
Lys Ser Ala 



ggg cag tat 
Gly Gin Tyr 



atg tea aag 
Met Ser Lys 
740 

get gaa att 
Ala Glu He 
755 

ggc gee gtg 
Gly Ala Val 
770 

ttg tat ccc 
Leu Tyr Pro 



gca ttg gcg 
Ala Leu Ala 



etc cag agg 
Leu Gin Arg 



aec act aca 
Thr Thr Thr 
615 

agt gca ctt 
Ser Ala Leu 
630 

cat age ctt 
His Ser Leu 
645 

agt gtg age 
Ser Val Ser 



gtt etc ggc 
Val Leu Gly 



gee cac gag 
Ala His Glu 
695 

acc ggt etc 
Thr Gly Leu 
710 

atg cga tgc 
Met Arg Cys 
725 

gaa ctt gtt 
Glu Leu Val 



gag cag teg 
Glu Gin Ser 



ttc cga age 
Phe Arg Ser 
775 

acc ttg aca 
Thr Leu Thr 
790 

gaa aat tct 
Glu Asn Ser 

805 



acg 1872 

Thr 

600 

get 1920 
Ala 



gcg 1968 
Ala 



caa 2016 
Gin 



ttc 2064 
Phe 



gta 2112 

Val 

680 

ate 2160 
He 



agt 2208 
Ser 



aaa 2256 
Lys 



cca 2304 
Pro 



gca 2352 

Ala 

760 

aaa 2400 
Lys 



ttt 2448 
Phe 



ate 2496 
He 
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ct g gg c tac tat a 9 c aa 9" ata g tc ttc 9 ta tgg gac aag ccg tgg tgg 2544 
Leu Gly Tyr Tyr Ser Lys lie Val Phe Val Trp Asp Lys Pro Trp Trp 
810 815 820 



cgc gaa caa ggc ttc teg ggc gtc etc caa teg age tgt gac ccc ate 2592 
Arg Glu Gin Gly Phe Ser Gly Val Leu Gin Ser Ser Cys Asp Pro lie 
825 830 835 840 

tea ttt gec aga gat acc age ate gac gtc gat cga caa tgg tec att 264 0 

Ser Phe Ala Arg Asp Thr Ser lie Asp Val Asp Arg Gin Trp Ser lie 
845 850 855 

acc tgt ttc atg gtc gga gac ccg gga egg aag tgg tec caa cag tec 2688 
Thr Cys Phe Met Val Gly Asp Pro Gly Arg Lys Trp Ser Gin Gin Ser 
860 865 870 

aag cag gta cga caa aag tct gtc tgg gac caa etc cgc gca gee tac 2736 
Lys Gin Val Arg Gin Lys Ser Val Trp Asp Gin Leu Arg Ala Ala Tyr 
875 880 885 

gag aac gec ggg gec caa gtc cca gag ccg gee aac gtg etc gaa ate 2784 
Glu Asn Ala Gly Ala Gin Val Pro Glu Pro Ala Asn Val Leu Glu lie 
890 895 900 

gag tgg teg aag cag cag tat ttc caa gga get ccg age gec gtc tat 283 2 

Glu Trp Ser Lys Gin Gin Tyr Phe Gin Gly Ala Pro Ser Ala Val Tyr 
905 910 915 920 

ggg ctg aac gat etc ate aca ctg ggt teg gcg etc aga acg ccg ttc 288 0 

Gly Leu Asn Asp Leu lie Thr Leu Gly Ser Ala Leu Arg Thr Pro Phe 
925 930 935 

aag agt gtt cat ttc gtt gga acg gag acg tct tta gtt tgg aaa ggg 292 8 

Lys Ser Val His Phe Val Gly Thr Glu Thr Ser Leu Val Trp Lys Gly 
940 945 950 

tat atg gaa ggg gec ata cga teg ggt caa cga ggt get gca gaa gtt 2 976 

Tyr Met Glu Gly Ala lie Arg Ser Gly Gin Arg Gly Ala Ala Glu Val 
955 960 965 

gtg get age ctg gtg cca gca gca tag 3 003 

Val Ala Ser Leu Val Pro Ala Ala 
970 975 



<210> 25 
<211> 1000 
<212> PRT 
<213> Unknown 

<220> 

<221> SIGNAL 
<222> (1) . . . (24) 

<400> 25 
Met Ala Asn Lys His Leu Ser Leu 
-20 

Leu Ser Ala Ser Leu Ala Ser Gly 
-5 

Gly Met Val Val Gly Thr Thr Thr 

10 15 
Val Ser Glu Phe Leu Gly Val Pro 



Ser Leu Phe Leu Val Leu Leu Gly 

-15 -10 
Ala Pro Thr Val Lys lie Asp Ala 

1 5 
Thr Val Pro Gly Thr Thr Ala Thr 
20 

Phe Ala Ala Ser Pro Thr Arg Phe 
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25 

Ala Pro Pro Thr 

Ala Tyr Gly Pro 
60 

Arg Glu lie Thr 
75 

Glu Ser Glu Asp 
90 

Asn Thr Asn Lys 
105 

Tyr Gly Trp Asn 

Asn Gin Asp Val 
140 

Gly Phe Pro Ala 
155 

Phe Leu Asp Gin 
170 

Ala Phe Gly Gly 
185 

Gly Gly Arg Ser 

Pro Phe Arg Ala 
220 

Pro Lys Gly Asp 
235 

Asn Cys Thr Thr 
250 

Leu Ala Thr Leu 
265 

Tyr Thr Leu Asp 

Thr Thr Gly Asp 
300 

Asn Asp Gly Leu 
315 

Leu Glu Glu Ala 
330 

Gly Ala Tyr Pro 
345 

lie Ala Ala lie 

Val Ala Gin Asp 
380 

Tyr Asn Ala Thr 
395 

Tyr His Ser Ser 
410 

Ser Ala Thr Ala 
425 

Trp Ala Ala Phe 

Val Pro Asn Val 
460 

Asp Val Ser Pro 
475 

Tyr Tyr Thr Glu 
490 

Ser Gly Gly Gly 
505 

Val Val Val Gly 
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Arg Pro Val Pro 
45 

Ala Cys Pro Gin 

Met Ala Trp Phe 
80 

Cys Leu Asn Leu 
95 

Ala Val Met Val 
110 

Ser Phe His Leu 
125 

lie Ala Val Thr 

Ala Pro Gin Leu 
160 

Arg Phe Ala Leu 
175 

Asp Pro Arg Lys 
190 

Val Asp Val Leu 
205 

Ala lie Met Glu 

Leu Ser Glu Pro 
240 

Ser lie Asp lie 
255 

Met Asn Thr lie 
270 

Asn Val Thr Ala 
285 

lie Ala Arg Val 

Leu Phe Val Leu 
320 

lie Pro Asn Gin 
335 

He Gly Ser Pro 
350 

Glu Thr Glu Val 
365 

Ser Arg Asn Arg 

Phe Glu Asn Leu 
400 

Glu Val Gly Met 
415 

Leu Glu Ala Gin 
430 

Ala Lys Asn Pro 
445 

Ala Ala Leu Gly 

Ala Thr He Asp 
480 

Leu Gly Thr lie 
495 

Ser Gly Gly Gly 
510 

Ala Gly Leu Ser 
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Trp Ser Thr Pro 
50 

Gin Phe Asn Tyr 
65 

Asn Thr Pro Pro 

Asn He Tyr Val 
100 

Trp He Tyr Gly 
115 

Tyr Asp Gly Ala 
130 

He Asn Tyr Arg 
145 

Pro He Thr Gin 

Asp Trp Val Gin 
180 

Val Thr He Phe 
195 

Leu Thr Ser Met 
210 

Ser Gly Val Ala 
225 

Trp Asn Thr Thr 

Leu Ser Cys Met 
260 

Glu Gin Leu Gly 
275 

Val Tyr Arg Ser 
290 

Pro Val Leu Val 
305 

Gly Glu Asn Asp 

Pro Asp Leu Tyr 
340 

Gly He Gly Ser 
355 

Arg Phe Gin Cys 
370 

Gly He Pro Ser 
385 

Glu Leu Phe Pro 

Val Phe Gly Thr 
420 

Thr Ser Lys Tyr 
435 

Met Asn Gly Pro 
450 

Ser Pro Gly Lys 
465 

Gin Arg Cys Ala 

Ala Pro Arg Thr 
500 

Ser Lys Asp Asn 
515 

Gly Leu Glu Thr 

35 
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Leu Gin Ala Thr 
55 

Pro Glu Glu Leu 
70 

Pro Ser Ala Gly 
85 

Pro Gly Thr Glu 

Gly Ala Leu Glu 
120 

Ser Phe Ala Ala 
135 

Thr Asn He Leu 
150 

Arg Asn Leu Gly 
165 

Arg Asn lie Ala 

Gly Gin Ser Ala 
200 

Pro His Asn Pro 
215 

Asn Tyr Asn Phe 
230 

Val Gin Ala Leu 
245 

Arg Arg Val Asp 

Leu Gly Phe Glu 
280 

Glu Thr Ala Arg 
295 

Gly Thr Val Ala 
310 

Thr Gin Ala Tyr 
325 

Gin Thr Leu Leu 

Pro Gin Asp Gin 
360 

Pro Ser Ala He 
375 

Trp Arg Tyr Tyr 
390 

Gly Ser Glu Val 
405 

Tyr Pro Val Ala 

Met Gin Gly Ala 
440 

Gly Trp Lys Gin 
455 

Ala He Gin Val 
470 

Leu Tyr Thr Arg 
485 

Phe Gly Gly Gly 

Val Ala Asp Val 
520 

Ala Arg Lys Val 



Gin Ala Ala Gly 
540 

Gly Gly Lys Thr 
555 

Asn Asp Leu Gly 
570 

Ser Arg Leu Phe 
585 

Thr Gly Asn Ser 

Pro Tyr Gly Asp 
620 

Glu Leu Leu Pro 
635 

Asp Leu Lys Ala 
650 

Ala His Tyr Cys 
665 

Ala Asn Gin lie 

Ser Met Leu Phe 
700 

Asn lie Phe Ser 
715 

Thr Gly Met Gin 
730 

Gly Ser Val His 
745 

Ser Gly Cys Thr 

Lys Val Val Val 
780 

Ser Pro Pro Leu 
795 

Leu Gly Tyr Tyr 
810 

Arg Glu Gin Gly 
825 

Ser Phe Ala Arg 

Thr Cys Phe Met 
860 

Lys Gin Val Arg 
875 

Glu Asn Ala Gly 
890 

Glu Trp Ser Lys 
905 

Gly Leu Asn Asp 

Lys Ser Val His 
940 

Tyr Met Glu Gly 
955 

Val Ala Ser Leu 
970 



525 

Leu Ser Cys Leu 

Leu Ser Val Gin 
560 

Ala Ala Trp lie 
575 

Glu Arg Phe His 
590 

lie His Gin Ala 
605 

Ser Leu Leu Ser 

Val Trp Ser Gin 
640 

Ser Pro Gin Ala 
655 

Glu Lys Glu Leu 
670 

Thr Arg Ala Leu 
685 

Leu Thr Asp Tyr 

Asp Lys Lys Asp 
720 

Ser lie Cys His 
735 

Leu Asn Thr Pro 
750 

Val Arg Ser Ala 
765 

Ser Leu Pro Thr 

Pro Ala Glu Lys 
800 

Ser Lys lie Val 
815 

Phe Ser Gly Val 
830 

Asp Thr Ser lie 
845 

Val Gly Asp Pro 

Gin Lys Ser Val 
880 

Ala Gin Val Pro 
895 

Gin Gin Tyr Phe 
910 

Leu lie Thr Leu 
925 

Phe Val Gly Thr 

Ala lie Arg Ser 
960 

Val Pro Ala Ala 
975 



530 

Val Leu Glu Ala 
545 

Ser Gly Pro Gly 

Asn Asp Ser Asn 
580 

Leu Glu Gly Glu 
595 

Gin Asp Gly Thr 
610 

Glu Glu Val Ala 
625 

Leu lie Glu Glu 

Lys Arg Leu Asp 
660 

Asn Leu Pro Ala 
675 

Leu Gly Val Glu 
690 

lie Lys Ser Ala 
705 

Gly Gly Gin Tyr 

Ala Met Ser Lys 
740 

Val Ala Glu lie 
755 

Ser Gly Ala Val 
770 

Thr Leu Tyr Pro 
785 

Gin Ala Leu Ala 

Phe Val Trp Asp 
820 

Leu Gin Ser Ser 
835 

Asp Val Asp Arg 
850 

Gly Arg Lys Trp 
865 

Trp Asp Gin Leu 

Glu Pro Ala Asn 
900 

Gin Gly Ala Pro 
915 

Gly Ser Ala Leu 
930 

Glu Thr Ser Leu 
945 

Gly Gin Arg Gly 



535 

Met Asp Arg Val 
550 

Arg Thr Thr lie 
565 

Gin Ser Glu Val 

Leu Gin Arg Thr 
600 

Thr Thr Thr Ala 
615 

Ser Ala Leu Ala 
630 

His Ser Leu Gin 
645 

Ser Val Ser Phe 

Val Leu Gly Val 
680 

Ala His Glu lie 
695 

Thr Gly Leu Ser 
710 

Met Arg Cys Lys 
725 

Glu Leu Val Pro 

Glu Gin Ser Ala 
760 

Phe Arg Ser Lys 
775 

Thr Leu Thr Phe 
790 

Glu Asn Ser lie 
805 

Lys Pro Trp Trp 

Cys Asp Pro lie 
840 

Gin Trp Ser lie 
855 

Ser Gin Gin Ser 
870 

Arg Ala Ala Tyr 
885 

Val Leu Glu lie 

Ser Ala Val Tyr 
920 

Arg Thr Pro Phe 
935 

Val Trp Lys Gly 
950 

Ala Ala Glu Val 
965 



<210> 26 
<211> 2976 
<212> DNA 
<213> Unknown 
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<220> 

<223> Barley alpha amylase signal sequence: BEST1 

mature: artificial spacer: and K: trAPAO . For 
plant expression. 



<221> sig_peptide 
<222> (1) . . . (72) 

<223> Barley alpha amylase signal sequence 

<221> mat^peptide 
<222> (73) . . . (1545) 
<223> BEST1 mature 

<221> misc_f eature 
<222> (1546) . . . (1584) 
<223> spacer sequence 

<221> misc_feature 
<222> (1585) . . . (2973) 
<223> K: trAPAO 

<221> CDS 

<222> (1) . . . (2973) 

<221> misc_f eature 
<222> (1585) . . . (1587) 
<223> Extra lysine 

<400> 26 

atg gcc aac aag cac ctg age etc tec etc ttc etc gtg etc etc ggc 48 
Met Ala Asn Lys His Leu Ser Leu Ser Leu Phe Leu Val Leu Leu Gly 
-20 -15 -10 

etc tec gcc tec etc gcc age ggc acg gat ttt ccg gtc cgc agg acc 96 
Leu Ser Ala Ser Leu Ala Ser Gly Thr Asp Phe Pro Val Arg Arg Thr 
-5 15 

gat ctg ggc cag gtt cag gga ctg gcc ggg gac gtg atg age ttt cgc 144 
Asp Leu Gly Gin Val Gin Gly Leu Ala Gly Asp Val Met Ser Phe Arg 
10 15 20 

gga ata ccc tat gca gcg ccg ccg gtg ggc ggg ctg cgt tgg aag ccg 192 
Gly lie Pro Tyr Ala Ala Pro Pro Val Gly Gly Leu Arg Trp Lys Pro 
25 30 35 40 

ccc caa cac gcc egg ccc tgg gcg ggc gtt cgc ccc gcc acc caa ttt 240 
Pro Gin His Ala Arg Pro Trp Ala Gly Val Arg Pro Ala Thr Gin Phe 
45 50 55 

ggc tec gac tgc ttc ggc gcg gcc tat ctt cgc aaa ggc age etc gcc 2 88 

Gly Ser Asp Cys Phe Gly Ala Ala Tyr Leu Arg Lys Gly Ser Leu Ala 
60 65 70 

ccc ggc gtg age gag gac tgt ctt tac etc aac gta tgg gcg ccg tea 336 
Pro Gly Val Ser Glu Asp Cys Leu Tyr Leu Asn Val Trp Ala Pro Ser 

75 80 85 

ggc get aaa ccc ggc cag tac ccc gtc atg gtc tgg gtc tac ggc ggc 384 
Gly Ala Lys Pro Gly Gin Tyr Pro Val Met Val Trp Val Tyr Gly Gly 
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90 95 100 

ggc ttc gcc ggc ggc acg gcc gcc atg ccc tac tac gac ggc gag gcg 432 
Gly Phe Ala Gly Gly Thr Ala Ala Met Pro Tyr Tyr Asp Gly Glu Ala 
105 110 115 120 

ctt gcg cga cag ggc gtc gtc gtg gtg acg ttt aac tat egg acg aac 480 
Leu Ala Arg Gin Gly Val Val Val Val Thr Phe Asn Tyr Arg Thr Asn 
125 130 135 

ate ctg ggc ttt ttc gcc cat cct ggt etc teg cgc gag age ccc acc 52 8 

He Leu Gly Phe Phe Ala His Pro Gly Leu Ser Arg Glu Ser Pro Thr 
140 145 150 

gga act teg ggc aac tae ggc eta etc gac att etc gcc get ctt egg 576 
Gly Thr Ser Gly Asn Tyr Gly Leu Leu Asp He Leu Ala Ala Leu Arg 
155 160 165 

tgg gtg cag age aac gcc cgc gcc ttc gga ggg gac ccc ggc cga gtg 624 
Trp Val Gin Ser Asn Ala Arg Ala Phe Gly Gly Asp Pro Gly Arg Val 
170 175 180 

acg gtc ttt ggt gaa teg gcc gga gcg age gcg ate gga ctt ctg etc 6 72 

Thr Val Phe Gly Glu Ser Ala Gly Ala Ser Ala He Gly Leu Leu Leu 
185 190 195 200 

acc teg ccg ctg age aag ggt etc ttc cgt ggc get ate etc gaa agt 72 0 

Thr Ser Pro Leu Ser Lys Gly Leu Phe Arg Gly Ala He Leu Glu Ser 
205 210 215 

cca ggg ctg acg cga ccg etc gcg acg etc gcc gac age gcc gcc teg 
Pro Gly Leu Thr Arg Pro Leu Ala Thr Leu Ala Asp Ser Ala Ala Ser 
220 225 230 

ggc gag cgc etc gac gcc gat ctt teg cga ctg cgc teg acc gac cca 
Gly Glu Arg Leu Asp Ala Asp Leu Ser Arg Leu Arg Ser Thr Asp Pro 
235 240 245 

gcc acc ctg atg gcg cgc gcc gac gcg gcc cgc ccg gca teg egg gac 
Ala Thr Leu Met Ala Arg Ala Asp Ala Ala Arg Pro Ala Ser Arg Asp 
250 255 260 

ctg cgc agg ccg cgt ccg acc gga ccg ate gtc gat ggc cat gtg ctg 912 
Leu Arg Arg Pro Arg Pro Thr Gly Pro He Val Asp Gly His Val Leu 
265 270 275 280 

ccg cag acc gac age gcg gcg ate gcg gcg ggg cag ctg gcg ccg gtt 960 
Pro Gin Thr Asp Ser Ala Ala He Ala Ala Gly Gin Leu Ala Pro Val 
285 290 295 

egg gtc ctg ate gga acc aat gcc gac gaa ggc cgc gcc ttc etc ggg 
Arg Val Leu He Gly Thr Asn Ala Asp Glu Gly Arg Ala Phe Leu Gly 
300 305 310 

cgc gcg ccg atg gag acg cca gcg gac tac caa gcc tat ctg gag gcg 
Arg Ala Pro Met Glu Thr Pro Ala Asp Tyr Gin Ala Tyr Leu Glu Ala 
315 320 325 

cag ttt ggc gac caa gcc gcc gcc gtg gcg gcg tgc tat ccc etc gac 1104 
Gin Phe Gly Asp Gin Ala Ala Ala Val Ala Ala Cys Tyr Pro Leu Asp 
330 335 340 



768 



816 



864 



1008 



1056 
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ggc egg gec acg ccc aag gaa atg gtc gcg cgc ate ttc ggc gac aat 1152 
Gly Arg Ala Thr Pro Lys Glu Met Val Ala Arg lie Phe Gly Asp Asn 
345 350 355 360 

cag ttc aat egg ggg gtc teg gee ttc teg gaa gcg ctt gtg cgc cag 
Gin Phe Asn Arg Gly Val Ser Ala Phe Ser Glu Ala Leu Val Arg Gin 
365 370 375 



1200 



1248 



1296 



1344 



ggc gcg ccc gtg tgg cgt tat cag ttc aac ggt aat acc gag ggt gga 
Gly Ala Pro Val Trp Arg Tyr Gin Phe Asn Gly Asn Thr Glu Gly Gly 
380 385 390 

aga gcg ccg get acc cac gga gec gaa att ccc tac gtt ttc ggg gtg 
Arg Ala Pro Ala Thr His Gly Ala Glu He Pro Tyr Val Phe Gly Val 
395 400 405 

ttc aag etc gac gag ttg ggt ctg ttc gat tgg ccg ccc gag ggg ccc 
Phe Lys Leu Asp Glu Leu Gly Leu Phe Asp Trp Pro Pro Glu Gly Pro 
410 415 420 

acg ccc gee gac cgt gcg ctg ggc caa ctg atg tec tec gec tgg gtc 13 92 

Thr Pro Ala Asp Arg Ala Leu Gly Gin Leu Met Ser Ser Ala Trp Val 
425 430 435 440 

egg ttc gec aag aat ggc gac ccc gee ggg gac gec ctt acc tgg cct 144 0 

Arg Phe Ala Lys Asn Gly Asp Pro Ala Gly Asp Ala Leu Thr Trp Pro 
445 450 455 

gec tat tct acg ggc aag teg acc atg aca ttc ggt ccc gag ggc cgc 
Ala Tyr Ser Thr Gly Lys Ser Thr Met Thr Phe Gly Pro Glu Gly Arg 
460 465 470 

gcg gcg gtg gtg teg ccc gga cct tec ate ccc cct tgc gcg gat ggc 
Ala Ala Val Val Ser Pro Gly Pro Ser He Pro Pro Cys Ala Asp Gly 
475 480 485 

gee aag gcg ggg ggc gga ggc age ggc gga ggc age ggc gga ggc age 
Ala Lys Ala Gly Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
490 495 500 

aaa gac aac gtt gcg gac gtg gta gtg gtg ggc get ggc ttg age ggt 
Lys Asp Asn Val Ala Asp Val Val Val Val Gly Ala Gly Leu Ser Gly 
505 510 515 520 

ttg gag acg gca cgc aaa gtc cag gec gee ggt ctg tec tgc etc gtt 
Leu Glu Thr Ala Arg Lys Val Gin Ala Ala Gly Leu Ser Cys Leu Val 
525 530 535 

ctt gag gcg atg gat cgt gta ggg gga aag act ctg age gta caa teg 
Leu Glu Ala Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser 
540 545 550 

ggt ccc ggc agg acg act ate aac gac etc ggc get gcg tgg ate aat 1776 
Gly Pro Gly Arg Thr Thr He Asn Asp Leu Gly Ala Ala Trp He Asn 
555 560 565 

gac age aac caa age gaa gta tec aga ttg ttt gaa aga ttt cat ttg 
Asp Ser Asn Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu 
570 575 580 

gag ggc gag etc cag agg acg act gga aat tea ate cat caa gca caa 
Glu Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser He His Gin Ala Gin 
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1488 



1536 



1584 



1632 



1680 



1728 



1824 



1872 



585 



590 



595 



600 



gac ggt aca acc act aca get cct tat ggt gac tec ttg ctg age gag 192 0 

Asp Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu 
605 610 615 

gag gtt gca agt gca ctt gcg gaa etc etc ccc gta tgg tct cag ctg 196 8 

Glu Val Ala Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu 
620 625 630 

ate gaa gag cat age ctt caa gac etc aag gcg age cct cag gcg aag 2016 
lie Glu Glu His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys 
635 640 645 

egg etc gac agt gtg age ttc gcg cac tac tgt gag aag gaa eta aac 2 064 

Arg Leu Asp Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn 
650 655 660 

ttg cct get gtt etc ggc gta gca aac cag ate aca cgc get ctg etc 
Leu Pro Ala Val Leu Gly Val Ala Asn Gin lie Thr Arg Ala Leu Leu 
665 670 675 680 

ggt gtg gaa gee cac gag ate age atg ctt ttt etc acc gac tac ate 
Gly Val Glu Ala His Glu He Ser Met Leu Phe Leu Thr Asp Tyr He 
685 690 695 

aag agt gee acc ggt etc agt aat att ttc teg gac aag aaa gac ggc 
Lys Ser Ala Thr Gly Leu Ser Asn He Phe Ser Asp Lys Lys Asp Gly 
700 705 710 

ggg cag tat atg cga tgc aaa aca ggt atg cag teg att tgc cat gee 
Gly Gin Tyr Met Arg Cys Lys Thr Gly Met Gin Ser He Cys His Ala 
715 720 725 

atg tea aag gaa ctt gtt cca ggc tea gtg cac etc aac acc ccc gtc 
Met Ser Lys Glu Leu Val Pro Gly Ser Val His Leu Asn Thr Pro Val 
730 735 740 

get gaa att gag cag teg gca tec ggc tgt aca gta cga teg gee teg 2352 
Ala Glu He Glu Gin Ser Ala Ser Gly Cys Thr Val Arg Ser Ala Ser 
745 750 755 760 



2112 



2160 



2208 



2256 



2304 



2400 



2448 



ggc gee gtg ttc cga age aaa aag gtg gtg gtt teg tta ccg aca acc 
Gly Ala Val Phe Arg Ser Lys Lys Val Val Val Ser Leu Pro Thr Thr 
765 770 775 

ttg tat ccc acc ttg aca ttt tea cca cct ctt ccc gee gag aag caa 
Leu Tyr Pro Thr Leu Thr Phe Ser Pro Pro Leu Pro Ala Glu Lys Gin 
780 785 790 

gca ttg gcg gaa aat tct ate ctg ggc tac tat age aag ata gtc ttc 24 96 

Ala Leu Ala Glu Asn Ser He Leu Gly Tyr Tyr Ser Lys He Val Phe 
795 800 805 

gta tgg gac aag ccg tgg tgg cgc gaa caa ggc ttc teg ggc gtc etc 2544 
Val Trp Asp Lys Pro Trp Trp Arg Glu Gin Gly Phe Ser Gly Val Leu 
810 815 820 

caa teg age tgt gac ccc ate tea ttt gee aga gat acc age ate gac 2592 
Gin Ser Ser Cys Asp Pro He Ser Phe Ala Arg Asp Thr Ser He Asp 
825 830 835 840 
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gtc gat cga caa tgg tec att acc tgt ttc atg gtc gga gac ccg gga 2640 
Val Asp Arg Gin Trp Ser lie Thr Cys Phe Met Val Gly Asp Pro Gly 
845 850 855 

egg aag tgg tec caa cag tec aag cag gta cga caa aag tct gtc tgg 268 8 

Arg Lys Trp Ser Gin Gin Ser Lys Gin Val Arg Gin Lys Ser Val Trp 
860 865 870 

gac caa etc cgc gca gee tac gag aac gee ggg gee caa gtc cca gag 273 6 

Asp Gin Leu Arg Ala Ala Tyr Glu Asn Ala Gly Ala Gin Val Pro Glu 
875 880 885 

ccg gec aac gtg etc gaa ate gag tgg teg aag cag cag tat ttc caa 2784 
Pro Ala Asn Val Leu Glu lie Glu Trp Ser Lys Gin Gin Tyr Phe Gin 
890 895 900 

gga get ccg age gec gtc tat ggg ctg aac gat etc ate aca ctg ggt 2832 
Gly Ala Pro Ser Ala Val Tyr Gly Leu Asn Asp Leu lie Thr Leu Gly 
905 910 915 920 

teg gcg etc aga acg ccg ttc aag agt gtt cat ttc gtt gga acg gag 2880 
Ser Ala Leu Arg Thr Pro Phe Lys Ser Val His Phe Val Gly Thr Glu 
925 930 935 

acg tct tta gtt tgg aaa ggg tat atg gaa ggg gee ata cga teg ggt 292 8 

Thr Ser Leu Val Trp Lys Gly Tyr Met Glu Gly Ala lie Arg Ser Gly 
940 945 950 

caa cga ggt get gca gaa gtt gtg get age ctg gtg cca gca gca 2973 
Gin Arg Gly Ala Ala Glu Val Val Ala Ser Leu Val Pro Ala Ala 
955 960 965 

tag 2976 

<210> 27 
<211> 991 
<212> PRT 
<213> Unknown 

<220> 

<221> SIGNAL 
<222> (1) . . . (24) 

<400> 27 

Met Ala Asn Lys His Leu Ser Leu Ser Leu Phe Leu Val Leu Leu Gly 

-20 -15 -10 

Leu Ser Ala Ser Leu Ala Ser Gly Thr Asp Phe Pro Val Arg Arg Thr 

-5 15 
Asp Leu Gly Gin Val Gin Gly Leu Ala Gly Asp Val Met Ser Phe Arg 

10 15 20 

Gly lie Pro Tyr Ala Ala Pro Pro Val Gly Gly Leu Arg Trp Lys Pro 
25 30 35 40 

Pro Gin His Ala Arg Pro Trp Ala Gly Val Arg Pro Ala Thr Gin Phe 

45 50 55 

Gly Ser Asp Cys Phe Gly Ala Ala Tyr Leu Arg Lys Gly Ser Leu Ala 

60 65 70 

Pro Gly Val Ser Glu Asp Cys Leu Tyr Leu Asn Val Trp Ala Pro Ser 

75 80 85 

Gly Ala Lys Pro Gly Gin Tyr Pro Val Met Val Trp Val Tyr Gly Gly 

90 95 100 

Gly Phe Ala Gly Gly Thr Ala Ala Met Pro Tyr Tyr Asp Gly Glu Ala 
105 110 115 120 
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Leu Ala Arg Gin 
lie Leu Gly Phe 

140 

Gly Thr Ser Gly 
155 

Trp Val Gin Ser 
170 

Thr Val Phe Gly 
185 

Thr Ser Pro Leu 

Pro Gly Leu Thr 
220 

Gly Glu Arg Leu 
235 

Ala Thr Leu Met 
250 

Leu Arg Arg Pro 
265 

Pro Gin Thr Asp 

Arg Val Leu lie 
300 

Arg Ala Pro Met 
315 

Gin Phe Gly Asp 
330 

Gly Arg Ala Thr 
345 

Gin Phe Asn Arg 

Gly Ala Pro Val 
380 

Arg Ala Pro Ala 
395 

Phe Lys Leu Asp 
410 

Thr Pro Ala Asp 
425 

Arg Phe Ala Lys 

Ala Tyr Ser Thr 
460 

Ala Ala Val Val 
475 

Ala Lys Ala Gly 
490 

Lys Asp Asn Val 
505 

Leu Glu Thr Ala 

Leu Glu Ala Met 
540 

Gly Pro Gly Arg 
555 

Asp Ser Asn Gin 
570 

Glu Gly Glu Leu 
585 

Asp Gly Thr Thr 



Gly Val Val Val 
125 

Phe Ala His Pro 

Asn Tyr Gly Leu 
160 

Asn Ala Arg Ala 
175 

Glu Ser Ala Gly 
190 

Ser Lys Gly Leu 
205 

Arg Pro Leu Ala 

Asp Ala Asp Leu 
240 

Ala Arg Ala Asp 
255 

Arg Pro Thr Gly 
270 

Ser Ala Ala lie 
285 

Gly Thr Asn Ala 

Glu Thr Pro Ala 
320 

Gin Ala Ala Ala 
335 

Pro Lys Glu Met 
350 

Gly Val Ser Ala 
365 

Trp Arg Tyr Gin 

Thr His Gly Ala 
400 

Glu Leu Gly Leu 
415 

Arg Ala Leu Gly 
430 

Asn Gly Asp Pro 
445 

Gly Lys Ser Thr 

Ser Pro Gly Pro 
480 

Gly Gly Gly Ser 
495 

Ala Asp Val Val 
510 

Arg Lys Val Gin 
525 

Asp Arg Val Gly 

Thr Thr lie Asn 
560 

Ser Glu Val Ser 
575 

Gin Arg Thr Thr 
590 

Thr Thr Ala Pro 
605 



Val Thr Phe Asn 
130 

Gly Leu Ser Arg 

14 5 

Leu Asp lie Leu 

Phe Gly Gly Asp 
180 

Ala Ser Ala lie 
195 

Phe Arg Gly Ala 
210 

Thr Leu Ala Asp 
225 

Ser Arg Leu Arg 

Ala Ala Arg Pro 
260 

Pro lie Val Asp 
275 

Ala Ala Gly Gin 
290 

Asp Glu Gly Arg 
305 

Asp Tyr Gin Ala 

Val Ala Ala Cys 
340 

Val Ala Arg lie 
355 

Phe Ser Glu Ala 
370 

Phe Asn Gly Asn 
385 

Glu lie Pro Tyr 

Phe Asp Trp Pro 
420 

Gin Leu Met Ser 
435 

Ala Gly Asp Ala 
450 

Met Thr Phe Gly 
465 

Ser lie Pro Pro 

Gly Gly Gly Ser 
500 

Val Val Gly Ala 
515 

Ala Ala Gly Leu 
530 

Gly Lys Thr Leu 
545 

Asp Leu Gly Ala 

Arg Leu Phe Glu 
580 

Gly Asn Ser lie 
595 

Tyr Gly Asp Ser 
610 
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Tyr Arg Thr Asn 
135 

Glu Ser Pro Thr 

150 

Ala Ala Leu Arg 
165 

Pro Gly Arg Val 

Gly Leu Leu Leu 
200 

lie Leu Glu Ser 
215 

Ser Ala Ala Ser 
230 

Ser Thr Asp Pro 
245 

Ala Ser Arg Asp 

Gly His Val Leu 
280 

Leu Ala Pro Val 
295 

Ala Phe Leu Gly 
310 

Tyr Leu Glu Ala 
325 

Tyr Pro Leu Asp 

Phe Gly Asp Asn 
360 

Leu Val Arg Gin 
375 

Thr Glu Gly Gly 
390 

Val Phe Gly Val 
405 

Pro Glu Gly Pro 

Ser Ala Trp Val 
440 

Leu Thr Trp Pro 
455 

Pro Glu Gly Arg 
470 

Cys Ala Asp Gly 
485 

Gly Gly Gly Ser 

Gly Leu Ser Gly 
520 

Ser Cys Leu Val 
535 

Ser Val Gin Ser 
550 

Ala Trp lie Asn 
565 

Arg Phe His Leu 

His Gin Ala Gin 
600 

Leu Leu Ser Glu 
615 



Glu 


Val 


Ala 


Ser 


Ala 


Leu 


Ala 


Glu 


Leu 


Leu 


Pro 


val 


Trp 


Ser 


tiin 


Leu 








620 










62 5 










C O A 

6 3 0 






lie 


Glu 


Glu 


His 


Ser 


Leu Gin Asp 


Leu 


Lys 


Aia 


Ser 


Pro 


uin 


Ala 


Lys 






635 










D 4t U 










645 








Arg 


Leu 


Asp 


Ser 


TT-i T 

vai 


Ser 


Phe 


Ala 


XJ-i e> 
XI 1 S 


Tyr 


Cys 


Hi n 
bill 


Lys 


ulU 


Leu 


Asn 




650 










655 










tz C f\ 
DO U 










Leu 


Pro 


ax a 


vai 


Leu 


Gly Val 


Ala 


Asn 


r*"\ n 

JjrXn 


lie 


•PVi v* 

i nr 


Arg 


iild 


Leu 


Leu 


665 










670 










675 










r n n 
DOU 


Gly 


Val 


Glu 


Ala 


His 


Glu 


lie 


Ser 


Met 


Leu 


pne 


Leu 


Thr 


Asp 


Tyr 


lie 










685 










690 










695 




Lys 


Ser 


Ala 


Thr 


Gly 


Leu 


Ser 


Asn 


lie 


Phe 


Ser 


Asp 


Lys 


Lys 


Asp 


(jiy 








700 










705 










710 






Gly 


Gin 


Tyr 


Met 


Arg 


Cys 


Lys 


Thr 


Gly 


Met 


Gin 


Ser 


lie 


Cys 


TT A r~i 

HIS 


Aia 






715 










720 










725 








Met 


Ser 


Lys 


Glu 


Leu 


Val 


Pro 


Gly 


Ser 


Val 


His 


Leu 


Asn 


Thr 


Pro 


Val 




730 










735 










740 










Ala 


Glu 


lie 


Glu 


Gin 


Ser 


Ala 


Ser 


Gly 


Cys 


Thr 


Val 


Arg 


Ser 


Ala 


Ser 


745 










750 










755 










"i ti t\ 
760 


Gly 


Ala 


Val 


Pne 


Arg 


Ser 


Lys 


Lys 


vai 


vax 


vai 


Ser 


Leu 


Pro 


Thr 


Thr 










765 










770 










775 




Leu 


Tyr 


Pro 


Thr 


Leu 


Thr 


Phe 


Ser 


Pro 


Pro 


Leu 


Pro 


Ala 


olU 


Lys 


bin 








780 










785 










790 






Ala 


Leu 


Ala 


Glu 


Asn 


Ser 


He 


Leu 


Gly 


Tyr 


Tyr 


Ser 


Lys 


lie 


Val 


Phe 






795 










800 










805 








Val 


Trp 


Asp 


Lys 


Pro 


Trp 


Trp 


Arg 


Glu 


Gin 


Gly 


Phe 


Ser 


Gly 


Val 


Leu 




810 










815 










820 










Gin 


Ser 


Ser 


Cys 


Asp 


Pro 


He 


Ser 


Phe 


Ala 


Arg 


Asp 


Thr 


Ser 


He 


Asp 


825 










830 










835 










840 


Val 


Asp 


Arg 


Gin 


Trp 


Ser 


lie 


Thr 


Cys 


Phe 


Met 


Val 


Gly 


Asp 


Pro 


Gly 










845 










850 










855 




Arg 


Lys 


Trp 


Ser 


Gin 


Gin 


Ser 


Lys 


Gin 


Val 


Arg 


Gin 


Lys 


Ser 


Val 


Trp 








860 










865 










870 






Asp 


Gin 


Leu 


Arg 


Ala 


Ala 


Tyr 


Glu 


Asn 


Ala 


Gly 


Ala 


Gin 


Val 


Pro 


Glu 






875 










880 










885 








Pro 


Ala 


Asn 


Val 


Leu 


Glu 


He 


Glu 


Trp 


Ser 


Lys 


Gin 


Gin 


Tyr 


Phe 


Gin 




890 










895 










900 










Gly 


Ala 


Pro 


Ser 


Ala 


Val 


Tyr 


Gly 


Leu 


Asn 


Asp 


Leu 


lie 


Thr 


Leu 


Gly 


905 










910 










915 










920 


Ser 


Ala 


Leu 


Arg 


Thr 


Pro 


Phe 


Lys 


Ser 


Val 


His 


Phe 


Val 


Gly 


Thr 


Glu 










925 










930 










935 




Thr 


Ser 


Leu 


Val 


Trp 


Lys 


Gly 


Tyr 


Met 


Glu 


Gly 


Ala 


He 


Arg 


Ser 


Gly 








940 










945 










950 






Gin 


Arg 


Gly 


Ala 


Ala 


Glu 


Val 


Val 


Ala 


Ser 


Leu 


Val 


Pro 


Ala 


Ala 








955 










960 










965 









<210> 28 

<211> 3618 

<212> DNA 

<213> Unknown 

<220> 

<223> gst :espl :sp:K: trapao, 3618. 1-687, gst + 

polylinker; 688-2190, espl mat; 2191-2226 spacer; 
2227-3615, K : trAPAO , 3616-3618, stop codon. For 
bacterial expression. 



<221> CDS 

<222> (1) . . . (3615) 

<221> misc_feature 
<222> (1) . . . (687) 

43 



<223> gst + polylinker 



ggt gat cat gta acc cat cct gac ttc atg ttg tat gac get ctt gat 
Gly Asp His Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala Leu Asp 
145 150 155 160 

gtt gtt tta tac atg gac cca atg tgc ctg gat gcg ttc cca aaa tta 
Val Val Leu Tyr Met Asp Pro Met Cys Leu Asp Ala Phe Pro Lys Leu 
165 170 175 

44 



48 



96 



<221> mat_peptide 
<222> (688) . . . (2190) 
<223> espl mat 

<221> misc_f eature 
<222> (2191) . . . (2226) 
<223> spacer sequence 

<221> misc_f eature 
<222> (2227) . . . (3615) 
<223> K : trAPAO 

<221> misc_feature 
<222> (2227) . . . (2229) 
<223> Extra lysine 

<400> 28 

atg tec cct ata eta ggt tat tgg aaa att aag ggc ctt gtg caa ccc 
Met Ser Pro lie Leu Gly Tyr Trp Lys lie Lys Gly Leu Val Gin Pro 
15 10 15 

act cga ctt ctt ttg gaa tat ctt gaa gaa aaa tat gaa gag cat ttg 
Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His Leu 
20 25 30 

tat gag cgc gat gaa ggt gat aaa tgg cga aac aaa aag ttt gaa ttg 144 
Tyr Glu Arg Asp Glu Gly Asp Lys Trp Arg Asn Lys Lys Phe Glu Leu 
35 40 45 

ggt ttg gag ttt ccc aat ctt cct tat tat att gat ggt gat gtt aaa 192 
Gly Leu Glu Phe Pro Asn Leu Pro Tyr Tyr lie Asp Gly Asp Val Lys 
50 55 60 

tta aca cag tct atg gec ate ata cgt tat ata get gac aag cac aac 240 
Leu Thr Gin Ser Met Ala lie lie Arg Tyr lie Ala Asp Lys His Asn 
65 70 75 80 

atg ttg ggt ggt tgt cca aaa gag cgt gca gag att tea atg ctt gaa 
Met Leu Gly Gly Cys Pro Lys Glu Arg Ala Glu lie Ser Met Leu Glu 
85 90 95 

gga gcg gtt ttg gat att aga tac ggt gtt teg aga att gca tat agt 
Gly Ala Val Leu Asp He Arg Tyr Gly Val Ser Arg He Ala Tyr Ser 
100 105 HO 

aaa gac ttt gaa act etc aaa gtt gat ttt ctt age aag eta cct gaa 
Lys Asp Phe Glu Thr Leu Lys Val Asp Phe Leu Ser Lys Leu Pro Glu 
115 120 125 

atg ctg aaa atg ttc gaa gat cgt tta tgt cat aaa aca tat tta aat 432 
Met Leu Lys Met Phe Glu Asp Arg Leu Cys His Lys Thr Tyr Leu Asn 
130 135 140 



288 



336 



384 



480 



528 



gtt tgt ttt aaa aaa cgt att gaa get ate cca caa att gat aag tac 576 
Val Cys Phe Lys Lys Arg lie Glu Ala lie Pro Gin lie Asp Lys Tyr 
180 185 190 

ttg aaa tec age aag tat ata gca tgg cct ttg cag ggc tgg caa gec 624 
Leu Lys Ser Ser Lys Tyr He Ala Trp Pro Leu Gin Gly Trp Gin Ala 
195 200 205 

acg ttt ggt ggt ggc gac cat cct cca aaa teg gat ctg gtt ccg cgt 672 
Thr Phe Gly Gly Gly Asp His Pro Pro Lys Ser Asp Leu Val Pro Arg 
210 215 220 

gga tec ccg gaa ttc get cct act gtc aag att gat get ggg atg gtg 720 
Gly Ser Pro Glu Phe Ala Pro Thr Val Lys He Asp Ala Gly Met Val 
225 230 235 240 

gtc ggc acg act act act gtc ccc ggc acc act gcg acc gtc age gag 76 8 

Val Gly Thr Thr Thr Thr Val Pro Gly Thr Thr Ala Thr Val Ser Glu 
245 250 255 

ttc ttg gge gtt cct ttt gee gec tct ccg aca cga ttt gcg cct cct 816 
Phe Leu Gly Val Pro Phe Ala Ala Ser Pro Thr Arg Phe Ala Pro Pro 
260 265 270 

act cgt ccc gtg cct tgg tea acg cct ttg caa gec act gca tat ggt 864 
Thr Arg Pro Val Pro Trp Ser Thr Pro Leu Gin Ala Thr Ala Tyr Gly 
275 280 285 

cca gca tgc cct caa caa ttc aat tac ccc gaa gaa etc cgt gag att 912 
Pro Ala Cys Pro Gin Gin Phe Asn Tyr Pro Glu Glu Leu Arg Glu He 
290 295 300 

acg atg gec tgg ttc aat aca ccg ccc ccg tea get ggt gaa agt gag 960 
Thr Met Ala Trp Phe Asn Thr Pro Pro Pro Ser Ala Gly Glu Ser Glu 
305 310 315 320 

gac tgc ctg aac etc aac ate tac gtc cca gga act gag aac aca aac 1008 
Asp Cys Leu Asn Leu Asn He Tyr Val Pro Gly Thr Glu Asn Thr Asn 
325 330 335 

aaa gee gtc atg gtt tgg ata tac ggt gga gcg ctg gaa tat ggt tgg 1056 
Lys Ala Val Met Val Trp He Tyr Gly Gly Ala Leu Glu Tyr Gly Trp 
340 345 350 

aat tea ttc cac ctt tac gac ggg get agt ttc gca gec aat cag gat 1104 
Asn Ser Phe His Leu Tyr Asp Gly Ala Ser Phe Ala Ala Asn Gin Asp 
355 360 365 

gtc ate gee gtg acc ate aac tac aga acg aac att ctg ggg ttc cct 1152 
Val He Ala Val Thr He Asn Tyr Arg Thr Asn He Leu Gly Phe Pro 
370 375 380 

get gec cct cag ctt cca ata aca cag cga aat ctg ggg ttc eta gac 12 0 0 

Ala Ala Pro Gin Leu Pro He Thr Gin Arg Asn Leu Gly Phe Leu Asp 
385 390 395 400 

caa agg ttt get ttg gat tgg gta cag egg aac ate gca gee ttt ggc 1248 
Gin Arg Phe Ala Leu Asp Trp Val Gin Arg Asn He Ala Ala Phe Gly 
405 410 415 

ggt gat cct cga aag gtc aca ata ttt ggg cag agt gcg ggg ggc aga 12 96 
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Gly Asp Pro Arg Lys Val Thr lie Phe Gly Gin Ser Ala Gly Gly Arg 
420 425 430 



a 9t 9" tc 9 ac 9 tc ctc ttg acg tct atg cca cac aac cca ccc ttc cga 1344 
Ser Val Asp Val Leu Leu Thr Ser Met Pro His Asn Pro Pro Phe Arg 
435 440 445 

gca gca ate atg gag tec ggt gtg get aac tac aac ttc ccc aag gga 1392 
Ala Ala lie Met Glu Ser Gly Val Ala Asn Tyr Asn Phe Pro Lys Gly 
450 455 460 

gat ttg tec gaa cct tgg aac acc act gtt caa get etc aac tgt acc 144 0 

Asp Leu Ser Glu Pro Trp Asn Thr Thr Val Gin Ala Leu Asn Cys Thr 
465 470 475 480 

acc agt ate gac ate ttg agt tgt atg aga aga gtc gat etc gec act 1488 
Thr Ser lie Asp He Leu Ser Cys Met Arg Arg Val Asp Leu Ala Thr 
485 490 495 

ctg atg aac acg ate gag caa etc gga ctt ggg ttt gag tac acg ttg 1536 
Leu Met Asn Thr lie Glu Gin Leu Gly Leu Gly Phe Glu Tyr Thr Leu 
500 505 510 

gac aac gta acg get gtg tac cgt tct gaa acg get cgc acg act ggt 1584 
Asp Asn Val Thr Ala Val Tyr Arg Ser Glu Thr Ala Arg Thr Thr Gly 
515 520 525 

gac att get cgt gta cct gtt etc gtc ggg acg gtg gec aac gac gga 1632 
Asp lie Ala Arg Val Pro Val Leu Val Gly Thr Val Ala Asn Asp Gly 
530 535 540 

ctt etc ttt gtc etc ggg gag aat gac acc caa gca tat etc gag gag 1680 
Leu Leu Phe Val Leu Gly Glu Asn Asp Thr Gin Ala Tyr Leu Glu Glu 
545 550 555 560 

gca ate ccg aat cag ccc gac ctt tac cag act etc ctt gga gca tat 1728 
Ala He Pro Asn Gin Pro Asp Leu Tyr Gin Thr Leu Leu Gly Ala Tyr 
565 570 575 

ccc att gga tec cca ggg ate gga teg cct caa gat cag att gec gec 1776 
Pro He Gly Ser Pro Gly He Gly Ser Pro Gin Asp Gin He Ala Ala 
580 585 590 

att gag acc gag gta aga ttc cag tgt cct tct gec ate gtg get cag 1824 
He Glu Thr Glu Val Arg Phe Gin Cys Pro Ser Ala He Val Ala Gin 
595 600 605 

gac tec egg aat egg ggt ate cct tct tgg cgc tac tac tac aat gcg 1872 
Asp Ser Arg Asn Arg Gly He Pro Ser Trp Arg Tyr Tyr Tyr Asn Ala 
610 615 620 

acc ttt gag aat ctg gag ctt ttc cct ggg tec gaa gtg tac cac age 192 0 

Thr Phe Glu Asn Leu Glu Leu Phe Pro Gly Ser Glu Val Tyr His Ser 
625 630 635 640 

tct gaa gtc ggg atg gtg ttt ggc acg tat cct gtc gca agt gcg acc 196 8 

Ser Glu Val Gly Met Val Phe Gly Thr Tyr Pro Val Ala Ser Ala Thr 
645 650 655 

gec ttg gag gec cag acg age aaa tac atg cag ggt gec tgg gcg gee 2 016 

Ala Leu Glu Ala Gin Thr Ser Lys Tyr Met Gin Gly Ala Trp Ala Ala 
660 665 670 
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ttt gcc aaa aac ccc atg aat ggg cct ggg tgg aaa caa gtg ccg aat 2064 
Phe Ala Lys Asn Pro Met Asn Gly Pro Gly Trp Lys Gin Val Pro Asn 
675 680 685 

gtc gcg gcg ctt ggc tea cca ggc aaa gcc ate cag gtt gac gtc tct 2112 
Val Ala Ala Leu Gly Ser Pro Gly Lys Ala lie Gin Val Asp Val Ser 
690 695 700 

cca gcg aca ata gac caa cga tgt gcc ttg tac acg cgt tat tat act 2160 
Pro Ala Thr lie Asp Gin Arg Cys Ala Leu Tyr Thr Arg Tyr Tyr Thr 
705 710 715 720 

gag ttg ggc aca ate gcg ccg agg aca ttt ggc gga ggc age ggc gga 22 08 

Glu Leu Gly Thr lie Ala Pro Arg Thr Phe Gly Gly Gly Ser Gly Gly 
725 730 735 

ggc age ggc gga ggc age aaa gac aac gtt gcg gac gtg gta gtg gtg 22 56 

Gly Ser Gly Gly Gly Ser Lys Asp Asn Val Ala Asp Val Val Val Val 
740 745 750 

ggc get ggc ttg age ggt ttg gag acg gca cgc aaa gtc cag gcc gcc 23 04 

Gly Ala Gly Leu Ser Gly Leu Glu Thr Ala Arg Lys Val Gin Ala Ala 
755 760 765 

ggt ctg tec tgc etc gtt ctt gag gcg atg gat cgt gta ggg gga aag 2352 
Gly Leu Ser Cys Leu Val Leu Glu Ala Met Asp Arg Val Gly Gly Lys 
770 775 780 

act ctg age gta caa teg ggt ccc ggc agg acg act ate aac gac etc 24 0 0 

Thr Leu Ser Val Gin Ser Gly Pro Gly Arg Thr Thr lie Asn Asp Leu 
785 790 795 800 

ggc get gcg tgg ate aat gac age aac caa age gaa gta tec aga ttg 244 8 

Gly Ala Ala Trp lie Asn Asp Ser Asn Gin Ser Glu Val Ser Arg Leu 
805 810 815 

ttt gaa aga ttt cat ttg gag ggc gag etc cag agg acg act gga aat 2496 
Phe Glu Arg Phe His Leu Glu Gly Glu Leu Gin Arg Thr Thr Gly Asn 
820 825 830 

tea ate cat caa gca caa gac ggt aca acc act aca get cct tat ggt 2544 
Ser lie His Gin Ala Gin Asp Gly Thr Thr Thr Thr Ala Pro Tyr Gly 
835 840 845 

gac tec ttg ctg age gag gag gtt gca agt gca ctt gcg gaa etc etc 2 592 

Asp Ser Leu Leu Ser Glu Glu Val Ala Ser Ala Leu Ala Glu Leu Leu 
850 855 860 

ccc gta tgg tct cag ctg ate gaa gag cat age ctt caa gac etc aag 264 0 

Pro Val Trp Ser Gin Leu lie Glu Glu His Ser Leu Gin Asp Leu Lys 
865 870 875 880 

gcg age cct cag gcg aag egg etc gac agt gtg age ttc gcg cac tac 2688 
Ala Ser Pro Gin Ala Lys Arg Leu Asp Ser Val Ser Phe Ala His Tyr 
885 890 895 

tgt gag aag gaa eta aac ttg cct get gtt etc ggc gta gca aac cag 2 736 

Cys Glu Lys Glu Leu Asn Leu Pro Ala Val Leu Gly Val Ala Asn Gin 
900 905 910 

ate aca cgc get ctg etc ggt gtg gaa gcc cac gag ate age atg ctt 27 84 
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He Thr Arg Ala Leu Leu Gly Val Glu Ala His Glu He Ser Met Leu 
915 920 925 



ttt etc acc gac tac ate aag agt gec acc ggt etc agt aat att ttc 2832 
Phe Leu Thr Asp Tyr He Lys Ser Ala Thr Gly Leu Ser Asn He Phe 
930 935 940 

teg gac aag aaa gac ggc ggg cag tat atg cga tgc aaa aca ggt atg 2880 
Ser Asp Lys Lys Asp Gly Gly Gin Tyr Met Arg Cys Lys Thr Gly Met 
945 950 955 960 

cag teg att tgc cat gec atg tea aag gaa ctt gtt cca ggc tea gtg 2928 
Gin Ser He Cys His Ala Met Ser Lys Glu Leu Val Pro Gly Ser Val 
965 970 975 

cac etc aac acc ccc gtc get gaa att gag cag teg gca tec ggc tgt 2 976 

His Leu Asn Thr Pro Val Ala Glu He Glu Gin Ser Ala Ser Gly Cys 
980 985 990 

aca gta cga teg gee teg ggc gec gtg ttc cga age aaa aag gtg gtg 3 024 

Thr Val Arg Ser Ala Ser Gly Ala Val Phe Arg Ser Lys Lys Val Val 
995 1000 1005 

gtt teg tta ccg aca acc ttg tat ccc acc ttg aca ttt tea cca cct 3072 
Val Ser Leu Pro Thr Thr Leu Tyr Pro Thr Leu Thr Phe Ser Pro Pro 
1010 1015 1020 

ctt ccc gec gag aag caa gca ttg gcg gaa aat tct ate ctg ggc tac 312 0 

Leu Pro Ala Glu Lys Gin Ala Leu Ala Glu Asn Ser He Leu Gly Tyr 
1025 1030 1035 1040 

tat age aag ata gtc ttc gta tgg gac aag ccg tgg tgg cgc gaa caa 3168 
Tyr Ser Lys He Val Phe Val Trp Asp Lys Pro Trp Trp Arg Glu Gin 
1045 1050 1055 

ggc ttc teg ggc gtc etc caa teg age tgt gac ccc ate tea ttt gee 3216 
Gly Phe Ser Gly Val Leu Gin Ser Ser Cys Asp Pro He Ser Phe Ala 
1060 1065 1070 



aga gat acc age ate gac gtc gat cga caa tgg tec att acc tgt ttc 
Arg Asp Thr Ser He Asp Val Asp Arg Gin Trp Ser He Thr Cys Phe 
1075 1080 1085 



3264 



atg gtc gga gac ccg gga egg aag tgg tec caa cag tec aag cag gta 3312 
Met Val Gly Asp Pro Gly Arg Lys Trp Ser Gin Gin Ser Lys Gin Val 
1090 1095 1100 

cga caa aag tct gtc tgg gac caa etc cgc gca gec tac gag aac gee 33 6 0 

Arg Gin Lys Ser Val Trp Asp Gin Leu Arg Ala Ala Tyr Glu Asn Ala 
1105 1110 1115 1120 

ggg gec caa gtc cca gag ccg gee aac gtg etc gaa ate gag tgg teg 3408 
Gly Ala Gin Val Pro Glu Pro Ala Asn Val Leu Glu He Glu Trp Ser 
1125 1130 1135 

aag cag cag tat ttc caa gga get ccg age gec gtc tat ggg ctg aac 3456 
Lys Gin Gin Tyr Phe Gin Gly Ala Pro Ser Ala Val Tyr Gly Leu Asn 
1140 1145 1150 

gat etc ate aca ctg ggt teg gcg etc aga acg ccg ttc aag agt gtt 35 04 

Asp Leu He Thr Leu Gly Ser Ala Leu Arg Thr Pro Phe Lys Ser Val 
1155 1160 1165 

48 



cat ttc gtt gga acg gag acg tct tta 
His Phe Val Gly Thr Glu Thr Ser Leu 
1170 1175 

ggg gcc at a cga teg ggt caa cga ggt 
Gly Ala lie Arg Ser Gly Gin Arg Gly 
1185 1190 

ctg gtg cca gca gca tag 
Leu Val Pro Ala Ala 
1205 



<210> 29 
<211> 1205 
<212> PRT 
<213> Unknown 



<400> 29 



Met 


Ser 


Pro 


lie 


Leu 


Gly 






Lys 


1 








5 










Thr 


Arg 


Leu 


Leu 


Leu 


Glu 


Tvr 


Leu 


Glu 








20 










25 


Tyr 


Glu 


Ara 


Asp 


Glu 


Glv 


Asp 


Lvs 


Trp 






35 










40 




Gly Leu 


Glu 


Phe 


Pro 


Asn 


Leu 


Pro 


Tyr 




50 










55 






Leu 


Thr 


Gin 


Ser 


Met 


Ala 


lie 


He 


Ara 


65 










70 








Met 


Leu 


Glv 


Glv 


Cys 


Pro 


Lys 


Glu 


Arcr 










85 










Gly Ala 


Val 


Leu 


Asp 


lie 


Arcr 


Tvr 

j. j j~ 


Glv 








100 










105 


Lys 


Asp 


Phe 


Glu 


Thr 


Leu 


Lys 


Val 


Asp 






115 










120 




Met 


Leu 


Lys 


Met 


Phe 


Glu 


Asp 


Arg 


Leu 




130 










135 






Gly Asp 


His 


Val 


Thr 


His 


Pro 


Asp 


Phe 


145 










150 








Val 


Val 


Leu 


Tyr 


Met 


Asp 


Pro 


Met 


Cys 










165 










Val 


Cys 


Phe 


Lys 


Lys 


Arg 


He 


Glu 


Ala 








180 










185 


Leu 


Lys 


Ser 


Ser 


Lys 


Tyr 


He 


Ala 


Trp 






195 










200 




Thr 


Phe 


Gly 


Gly 


Gly 


Asp 


His 


Pro 


Pro 




210 










215 






Gly Ser 


Pro 


Glu 


Phe 


Ala 


Pro 


Thr 


Val 


225 










230 








Val 


Gly 


Thr 


Thr 


Thr 


Thr 


Val 


Pro 


Gly 










245 










Phe 


Leu 


Gly 


Val 


Pro 


Phe 


Ala 


Ala 


Ser 








260 










265 


Thr 


Arg 


Pro 


Val 


Pro 


Trp 


Ser 


Thr 


Pro 






275 










280 




Pro 


Ala 


Cys 


Pro 


Gin 


Gin 


Phe 


Asn 


Tyr 




290 










295 






Thr 


Met 


Ala 


Trp 


Phe 


Asn 


Thr 


Pro 


Pro 


305 










310 








Asp 


Cys 


Leu 


Asn 


Leu 


Asn 


He 


Tyr 


Val 



325 



gtt tgg aaa ggg tat atg gaa 3552 
Val Trp Lys Gly Tyr Met Glu 
1180 

get gca gaa gtt gtg get age 3600 
Ala Ala Glu Val Val Ala Ser 
1195 1200 

3618 



He 


Lys 


Gly Leu Val 


Gin 


Pro 


10 










15 




Glu 


Lys 


Tyr 


Glu 


Glu 


His 


Leu 










30 






Arg 


Asn 


Lys 


Lys 


Phe 


Glu 


Leu 








45 








Tvr 


He 


Asp 


Gly Asp 


Val 


Lys 






60 










Tvr 


He 


Ala 


Asp 


Lys 


His 


Asn 




75 










80 


Ala 


Glu 


He 


Ser 


Met 


Leu 


Glu 


90 










95 




Val 


Ser 


Arg 


He 


Ala 


Tyr 


Ser 










110 






Phe 


Leu 


Ser 


Lys 


Leu 


Pro 


Glu 








125 








Cys 


His 


Lys 


Thr 


Tyr 


Leu 


Asn 






14 0 










Met 


Leu 


Tyr 


Asp 


Ala 


Leu 


Asp 




155 










160 


Leu 


Asp 


Ala 


Phe 


Pro 


Lys 


Leu 


170 










175 




He 


Pro 


Gin 


He 


Asp 


Lys 


Tyr 










190 






Pro 


Leu 


Gin Gly 


Trp 


Gin 


Ala 








205 








Lys 


Ser 


Asp 


Leu 


Val 


Pro 


Arg 






220 










Lys 


He 


Asp 


Ala 


Gly Met 


Val 




235 










240 


Thr 


Thr 


Ala 


Thr 


Val 


Ser 


Glu 


250 










255 




Pro 


Thr 


Arg 


Phe 


Ala 


Pro 


Pro 










270 






Leu 


Gin 


Ala 


Thr 


Ala 


Tyr 


Gly 








285 








Pro 


Glu 


Glu 


Leu 


Arg 


Glu 


He 






300 










Pro 


Ser 


Ala 


Gly 


Glu 


Ser 


Glu 




315 










320 


Pro 


Gly 


Thr 


Glu 


Asn 


Thr 


Asn 


330 










335 





49 



Lys 


Ala 


Val 


Met 
340 


Val 


Trp 


He 


Tyr 


Gly 
345 


Gly 


Ala 


Leu 


Glu 


Tyr 
350 


Gly 


Trp 


Asn 


Ser 


Phe 
355 


His 


Leu 


Tyr 


Asp 


Gly 
360 


Ala 


Ser 


Phe 


Ala 


Ala 
365 


Asn 


Gin 


Asp 


Val 


He 
370 


Ala 


Val 


Thr 


He 


Asn 
375 


Tyr 


Arg 


Thr 


Asn 


He 
380 


Leu 


Gly 


Phe 


Pro 


Ala 


Ala 


Pro 


Gin 


Leu 


Pro 


He 


Thr 


Gin 


Arg 


Asn 


Leu 


Gly 


Phe 


Leu 


Asp 


385 










390 










395 










400 


Gin 


Arg 


Phe 


Ala 


Leu 
405 


Asp 


Trp 


Val 


Gin 


Arg 
410 


Asn 


He 


Ala 


Ala 


Phe 
415 


Gly 


Gly 


Asp 


Pro 


Arg 
420 


Lys 


Val 


Thr 


He 


Phe 
425 


Gly 


Gin 


Ser 


Ala 


Gly 
430 


Gly 


Arg 


Ser 


Val 


Asp 
435 


Val 


Leu 


Leu 


Thr 


Ser 
440 


Met 


Pro 


His 


Asn 


Pro 
445 


Pro 


Phe 


Arg 


Ala 


Ala 
450 


He 


Met 


Glu 


Ser 


Gly 
455 


Val 


Ala 


Asn 


Tyr 


Asn 
460 


Phe 


Pro 


Lys 


Gly 


Asp 


Leu 


Ser 


Glu 


Pro 


Trp 


Asn 


Thr 


Thr 


Val 


Gin 


Ala 


Leu 


Asn 


Cys 


Thr 


465 










470 










475 










480 


Thr 


Ser 


He 


Asp 


He 
485 


Leu 


Ser 


Cys 


Met 


Arg 
490 


Arg 


Val 


Asp 


Leu 


Ala 
495 


Thr 


Leu 


Met 


Asn 


Thr 
500 


He 


Glu 


Gin 


Leu 


Gly 
505 


Leu 


Gly 


Phe 


Glu 


Tyr 
510 


Thr 


Leu 


Asp 


As IX 


Val 
515 


Thr 


Ala 


Val 


Tyr 


Arg 
520 


Ser 


Glu 


Thr 


Ala 


Arg 
525 


Thr 


Thr 


Gly 


Asp 


He 
530 


Ala 


Arg 


Val 


Pro 


Val 
535 


Leu 


Val 


Gly 


Thr 


Val 
540 


Ala 


Asn 


Asp 


Gly 


Leu 


Leu 


Phe 


Val 


Leu 


Gly 


Glu 


Asn 


Asp 


Thr 


Gin 


Ala 


Tyr 


Leu 


Glu 


Glu 


545 










550 










555 










560 


Ala 


He 


Pro 


Asn 


Gin 
565 


Pro 


Asp 


Leu 


Tyr 


Gin 
570 


Thr 


Leu 


Leu 


Gly 


Ala 
575 


Tyr 


Pro 


He 


Gly 


Ser 
580 


Pro 


Gly 


He 


Gly 


Ser 
585 


Pro 


Gin 


Asp 


Gin 


He 
590 


Ala 


Ala 


lie 


Glu 


Thr 
595 


Glu 


Val 


Arg 


Phe 


Gin 
600 


Cys 


Pro 


Ser 


Ala 


He 
605 


Val 


Ala 


Gin 


Asp 


Ser 
610 


Arg 


Asn 


Arg 


Gly 


He 
615 


Pro 


Ser 


Trp 


Arg 


Tyr 
620 


Tyr 


Tyr 


Asn 


Ala 


Thr 


Phe 


Glu 


Asn 


Leu 


Glu 


Leu 


Phe 


Pro 


Gly 


Ser 


Glu 


Val 


Tyr 


His 


Ser 


625 










630 










635 










640 


Ser 


Glu 


Val 


Gly 


Met 
645 


Val 


Phe 


Gly 


Thr 


Tyr 
650 


Pro 


Val 


Ala 


Ser 


Ala 
655 


Thr 


Ala 


Leu 


Glu 


Ala 
660 


Gin 


Thr 


Ser 


Lys 


Tyr 
665 


Met 


Gin 


Gly 


Ala 


Trp 
670 


Ala 


Ala 


Phe 


Ala 


Lys 
675 


Asn 


Pro 


Met 


Asn 


Gly 
680 


Pro 


Gly 


Trp 


Lys 


Gin 
685 


Val 


Pro 


Asn 


Val 


Ala 
690 


Ala 


Leu 


Gly 


Ser 


Pro 
695 


Gly 


Lys 


Ala 


He 


Gin 
700 


Val 


Asp 


Val 


Ser 


Pro 


Ala 


Thr 


He 


Asp 


Gin 


Arg 


Cys 


Ala 


Leu 


Tyr 


Thr 


Arg 


Tyr 


Tyr 


Thr 


705 










710 










715 










720 


Glu 


Leu 


Gly 


Thr 


He 
725 


Ala 


Pro 


Arg 


Thr 


Phe 
730 


Gly 


Gly 


Gly 


Ser 


Gly 
735 


Gly 


Gly 


Ser 


Gly 


Gly 
74 0 


Gly 


Ser 


Lys 


Asp 


Asn 
745 


Val 


Ala 


Asp 


Val 


Val 
750 


Val 


Val 


Gly 


Ala 


Gly 
755 


Leu 


Ser 


Gly 


Leu 


Glu 
760 


Thr 


Ala 


Arg 


Lys 


Val 
765 


Gin 


Ala 


Ala 


Gly 


Leu 
770 


Ser 


Gys 


Leu 


Val 


Leu 
775 


Glu 


Ala 


Met 


Asp 


Arg 
780 


Val 


Gly 


Gly 


Lys 


Thr 


Leu 


Ser 


Val 


Gin 


Ser 


Gly 


Pro 


Gly 


Arg 


Thr 


Thr 


He 


Asn 


Asp 


Leu 


785 










790 










795 










800 


Gly 


Ala 


Ala 


Trp 


He 
805 


Asn 


Asp 


Ser 


Asn 


Gin 
810 


Ser 


Glu 


Val 


Ser 


Arg 
815 


Leu 


Phe 


Glu 


Arg 


Phe 
820 


His 


Leu 


Glu 


Gly 


Glu 
825 


Leu 


Gin 


Arg 


Thr 


Thr 
830 


Gly 


Asn 



50 



Ser 


lie 


His 
835 


Gin 


Ala 


Gin 


Asp 


Gly 
840 


Thr 


Thr 


Thr 


Thr 


Ala 
845 


Pro 


Tyr 


Gly 


Asp 


Ser 
850 


Leu 


Leu 


Ser 


Glu 


Glu 
855 


Val 


Ala 


Ser 


Ala 


Leu 
860 


Ala 


Glu 


Leu 


Leu 


Pro 


Val 


Trp 


Ser 


Gin 


Leu 


He 


Glu 


Glu 


His 


Ser 


Leu 


Gin 


Asp 


Leu 


Lys 


865 










870 










875 










880 


Ala 


Ser 


Pro 


Gin 


Ala 
885 


Lys 


Arg 


Leu 


Asp 


Ser 
890 


Val 


Ser 


Phe 


Ala 


His 
895 


Tyr 


Cys 


Glu 


Lys 


Glu 
900 


Leu 


Asn 


Leu 


Pro 


Ala 
905 


Val 


Leu 


Gly 


Val 


Ala 
910 


Asn 


Gin 


lie 


Thr 


Arg 
915 


Ala 


Leu 


Leu 


Gly 


Val 
920 


Glu 


Ala 


His 


Glu 


He 
925 


Ser 


Met 


Leu 


Phe 


Leu 
930 


Thr 


Asp 


Tyr 


He 


L YS 
935 


Ser 


Ala 


Thr 


Gly 


Leu 
940 


Ser 


Asn 


He 


Phe 


Ser 


Asp 


Lys 


Lys 


Asp 


Gly 


Gly 


Gin 


Tyr 


Met 


Arg 


Cys 


Lys 


Thr 


Gly 


Met 


945 










950 










955 










960 


Gin 


Ser 


lie 


Cys 


His 
965 


Ala 


Met 


Ser 


Lys 


Glu 
970 


Leu 


Val 


Pro 


Gly 


Ser 
975 


Val 


His 


Leu 


Asn 


Thr 
980 


Pro 


Val 


Ala 


Glu 


He 
985 


Glu 


Gin 


Ser 


Ala 


Ser 
990 


Gly 


Cys 


Thr 


Val 


Arg 


Ser 


Ala 


Ser 


Gly 


Ala 


Val 


Phe 


Arg 


Ser 


Lys 


Lys 


Val 


Val 






995 










1000 








1005 






Val 


Ser 


Leu 


Pro 


Thr 


Thr 


Leu 


Tyr 


Pro 


Thr 


Leu 


Thr 


Phe 


Ser 


Pro 


Pro 




1010 








1015 








1020 








Leu 


Pro 


Ala 


Glu 


Lys 


Gin 


Ala 


Leu 


Ala 


Glu 


Asn 


Ser 


He 


Leu 


Gly 


Tyr 


1025 








1030 








1035 








104< 


Tyr 


Ser 


Lys 


He 


Val 


Phe 


Val 


Trp 


Asp 


Lys 


Pro 


Trp 


Trp 


Arg 


Glu 


Gin 










1045 








1050 








1055 


Gly Phe 


Ser 


Gly 


Val 


Leu 


Gin 


Ser 


Ser 


Cys 


Asp 


Pro 


He 


Ser 


Phe 


Ala 








1060 








1065 








1070 




Arg 


Asp 


Thr 


Ser 


He 


Asp 


Val 


Asp 


Arg 


Gin 


Trp 


Ser 


He 


Thr 


Cys 


Phe 






1075 








1080 








1085 






Met 


Val 


Gly Asp 


Pro 


Gly Arg 


Lys 


Trp 


Ser 


Gin 


Gin 


Ser 


Lys 


Gin 


val 




1090 








1095 








1100 








Arg 


Gin 


Lys 


Ser 


Val 


Trp 


Asp 


Gin Leu Arg Ala Ala 


Tyr 


Glu 


Asn 


Ala 


1105 








1110 








1115 








1121 


Gly Ala 


Gin 


Val 


Pro 


Glu 


Pro 


Ala 


Asn 


Val 


Leu 


Glu 


He 


Glu 


Trp 


Ser 










1125 








1130 








1135 


Lys 


Gin 


Gin 


Tyr 


Phe 


Gin Gly Ala 


Pro 


Ser 


Ala 


Val 


Tyr 


Gly 


Leu 


Asn 








1140 








1145 








1150 




Asp 


Leu 


He 


Thr 


Leu 


Gly Ser Ala 


Leu 


Arg 


Thr 


Pro 


Phe 


Lys 


Ser 


Val 






1155 








1160 








1165 






His 


Phe 


Val 


Gly Thr 


Glu 


Thr 


Ser 


Leu 


Val 


Trp 


Lys 


Gly Tyr Met 


Glu 




1170 








1175 








1180 








Gly Ala 


He 


Arg 


Ser 


Gly Gin Arg 


Gly Ala Ala 


Glu 


Val 


Val 


Ala 


Ser 



1185 1190 1195 1200 

Leu Val Pro Ala Ala 
1205 

<210> 30 
<211> 3591 
<212> DNA 
<213> Unknown 

<220> 

<223> Open reading frame of BEST1 : K: trAPAO fusion for 
bacterial expression vector pGEX-4T-l or similar 
vector, gst :BEST1 : sp :K: trAPAO fusion, 3591 nt . 
1-687 gst 

+ polylinker, 688-2163, BEST1 mature; 2164-2199, 
spacer, 2200-3588, KrtrAPAO 



51 



<221> mis c_f eature 
<222> (1) . . . (687) 
<223> gst + polylinker 

<221> mat_peptide 
<222> (688) . . . (2163) 
<223> BEST1 mature 



<221> misc__f eature 
<222> (2164) . . . (2199) 
<223> spacer sequence 

<221> mis cofeature 
<222> (2200) . . . (3588) 
<223> K : trAPAO 



<221> CDS 

<222> (1) . . . (3588) 

<221> misc_feature 
<222> (2200) . . . (2202) 
<223> Extra lysine 



48 



96 



<400> 30 

atg tec cct ata eta ggt tat tgg aaa att aag ggc ctt gtg caa ccc 
Met Ser Pro lie Leu Gly Tyr Trp Lys He Lys Gly Leu Val Gin Pro 
15 10 15 

act cga ctt ctt ttg gaa tat ctt gaa gaa aaa tat gaa gag cat ttg 
Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His Leu 
20 25 30 

tat gag cgc gat gaa ggt gat aaa tgg cga aac aaa aag ttt gaa ttg 144 
Tyr Glu Arg Asp Glu Gly Asp Lys Trp Arg Asn Lys Lys Phe Glu Leu 
35 40 45 



ggt ttg gag ttt ccc aat ctt cct tat tat att gat ggt gat gtt aaa 
Gly Leu Glu Phe Pro Asn Leu Pro Tyr Tyr He Asp Gly Asp Val Lys 
50 55 60 

tta aca cag tct atg gec ate ata cgt tat ata get gac aag cac aac 
Leu Thr Gin Ser Met Ala He He Arg Tyr He Ala Asp Lys His Asn 
65 70 75 80 



atg ttg ggt ggt tgt cca aaa gag cgt gca gag att tea atg ctt gaa 
Met Leu Gly Gly Cys Pro Lys Glu Arg Ala Glu He Ser Met Leu Glu 
85 90 95 



288 



gga gcg gtt ttg gat att aga tac ggt gtt teg aga att gca tat agt 
Gly Ala Val Leu Asp He Arg Tyr Gly Val Ser Arg He Ala Tyr Ser 
100 105 HO 



336 



aaa gac ttt gaa act etc aaa gtt gat ttt ctt age aag eta cct gaa 3 84 

Lys Asp Phe Glu Thr Leu Lys Val Asp Phe Leu Ser Lys Leu Pro Glu 
115 120 125 



atg ctg aaa atg ttc gaa gat cgt tta tgt cat aaa aca tat tta aat 
Met Leu Lys Met Phe Glu Asp Arg Leu Cys His Lys Thr Tyr Leu Asn 
130 135 140 



432 



ggt gat cat gta ace cat cct gac ttc atg ttg tat 
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gac get ctt gat 



480 



Gly Asp His Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala Leu Asp 
145 150 155 160 



gtt gtt tta tac atg gac cca atg tgc ctg gat gcg ttc cca aaa tta 528 
Val Val Leu Tyr Met Asp Pro Met Cys Leu Asp Ala Phe Pro Lys Leu 
165 170 175 

gtt tgt ttt aaa aaa cgt att gaa get ate cca caa att gat aag tac 576 
Val Cys Phe Lys Lys Arg lie Glu Ala He Pro Gin He Asp Lys Tyr 
180 185 190 

ttg aaa tec age aag tat ata gca tgg cct ttg cag ggc tgg caa gec 624 
Leu Lys Ser Ser Lys Tyr He Ala Trp Pro Leu Gin Gly Trp Gin Ala 
195 200 205 

acg ttt ggt ggt ggc gac cat cct cca aaa teg gat ctg gtt ccg cgt 6 72 

Thr Phe Gly Gly Gly Asp His Pro Pro Lys Ser Asp Leu Val Pro Arg 
210 215 220 

gga tec ccg gaa ttc acg gat ttt ccg gtc cgc agg acc gat ctg ggc 720 
Gly Ser Pro Glu Phe Thr Asp Phe Pro Val Arg Arg Thr Asp Leu Gly 
225 230 235 240 

cag gtt cag gga ctg gec ggg gac gtg atg age ttt cgc gga ata ccc 768 
Gin Val Gin Gly Leu Ala Gly Asp Val Met Ser Phe Arg Gly He Pro 
245 250 255 

tat gca gcg ccg ccg gtg ggc ggg ctg cgt tgg aag ccg ccc caa cac 816 
Tyr Ala Ala Pro Pro Val Gly Gly Leu Arg Trp Lys Pro Pro Gin His 
260 265 270 

gec egg ccc tgg gcg ggc gtt cgc ccc gee acc caa ttt ggc tec gac 864 
Ala Arg Pro Trp Ala Gly Val Arg Pro Ala Thr Gin Phe Gly Ser Asp 
275 280 285 

tgc ttc ggc gcg gec tat ctt cgc aaa ggc age etc gec ccc ggc gtg 912 
Cys Phe Gly Ala Ala Tyr Leu Arg Lys Gly Ser Leu Ala Pro Gly Val 
290 295 300 

age gag gac tgt ctt tac etc aac gta tgg gcg ccg tea ggc get aaa 960 
Ser Glu Asp Cys Leu Tyr Leu Asn Val Trp Ala Pro Ser Gly Ala Lys 
305 310 315 320 

ccc ggc cag tac ccc gtc atg gtc tgg gtc tac ggc ggc ggc ttc gec 10 08 

Pro Gly Gin Tyr Pro Val Met Val Trp Val Tyr Gly Gly Gly Phe Ala 
325 330 335 

<?gc ggc acg gec gec atg ccc tac tac gac ggc gag gcg ctt gcg cga 1056 
Gly Gly Thr Ala Ala Met Pro Tyr Tyr Asp Gly Glu Ala Leu Ala Arg 
340 345 350 

cag ggc gtc gtc gtg gtg acg ttt aac tat egg acg aac ate ctg ggc 1104 
Gin Gly Val Val Val Val Thr Phe Asn Tyr Arg Thr Asn He Leu Gly 
355 360 365 

ttt ttc gec cat cct ggt etc teg cgc gag age ccc acc gga act teg 1152 
Phe Phe Ala His Pro Gly Leu Ser Arg Glu Ser Pro Thr Gly Thr Ser 
370 375 380 

ggc aac tac ggc eta etc gac att etc gee get ctt egg tgg gtg cag 12 00 

Gly Asn Tyr Gly Leu Leu Asp He Leu Ala Ala Leu Arg Trp Val Gin 
385 390 395 400 
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age aac gec cgc gec ttc gga ggg gac ccc ggc cga gtg acg gtc ttt 1248 
Ser Asn Ala Arg Ala Phe Gly Gly Asp Pro Gly Arg Val Thr Val Phe 
405 410 415 

ggt gaa teg gec gga gcg age gcg ate gga ctt ctg etc ace teg ccg 12 96 

Gly Glu Ser Ala Gly Ala Ser Ala lie Gly Leu Leu Leu Thr Ser Pro 
420 425 430 

ctg age aag ggt etc ttc cgt ggc get ate etc gaa agt cca ggg ctg 1344 
Leu Ser Lys Gly Leu Phe Arg Gly Ala lie Leu Glu Ser Pro Gly Leu 
435 440 445 

acg cga ccg etc gcg acg etc gec gac age gec gec teg ggc gag cgc 1392 
Thr Arg Pro Leu Ala Thr Leu Ala Asp Ser Ala Ala Ser Gly Glu Arg 
450 455 460 

etc gac gee gat ctt teg cga ctg cgc teg acc gac cca gee ace ctg 144 0 

Leu Asp Ala Asp Leu Ser Arg Leu Arg Ser Thr Asp Pro Ala Thr Leu 
465 470 475 480 

atg gcg cgc gee gac gcg gec cgc ccg gca teg egg gac ctg cgc agg 148 8 

Met Ala Arg Ala Asp Ala Ala Arg Pro Ala Ser Arg Asp Leu Arg Arg 
485 490 495 

ccg cgt ccg acc gga ccg ate gtc gat ggc cat gtg ctg ccg cag acc 153 6 

Pro Arg Pro Thr Gly Pro lie Val Asp Gly His Val Leu Pro Gin Thr 
500 505 510 

gac age gcg gcg ate gcg gcg ggg cag ctg gcg ccg gtt egg gtc ctg 1584 
Asp Ser Ala Ala lie Ala Ala Gly Gin Leu Ala Pro Val Arg Val Leu 
515 520 525 

ate gga acc aat gee gac gaa ggc cgc gec ttc etc ggg cgc gcg ccg 1632 
lie Gly Thr Asn Ala Asp Glu Gly Arg Ala Phe Leu Gly Arg Ala Pro 
530 535 540 

atg gag acg cca gcg gac tac caa gee tat ctg gag gcg cag ttt ggc 16 80 

Met Glu Thr Pro Ala Asp Tyr Gin Ala Tyr Leu Glu Ala Gin Phe Gly 
545 550 555 560 

gac caa gee gee gee gtg gcg gcg tgc tat ccc etc gac ggc egg gec 172 8 

Asp Gin Ala Ala Ala Val Ala Ala Cys Tyr Pro Leu Asp Gly Arg Ala 
565 570 575 

acg ccc aag gaa atg gtc gcg cgc ate ttc ggc gac aat cag ttc aat 1776 
Thr Pro Lys Glu Met Val Ala Arg lie Phe Gly Asp Asn Gin Phe Asn 
580 585 590 

c< 3<3 ggg gtc teg gee ttc teg gaa gcg ctt gtg cgc cag ggc gcg ccc 1824 
Arg Gly Val Ser Ala Phe Ser Glu Ala Leu Val Arg Gin Gly Ala Pro 
595 600 605 

gtg tgg cgt tat cag ttc aac ggt aat acc gag ggt gga aga gcg ccg 18 72 

Val Trp Arg Tyr Gin Phe Asn Gly Asn Thr Glu Gly Gly Arg Ala Pro 
610 615 620 

get acc cac gga gee gaa att ccc tac gtt ttc ggg gtg ttc aag etc 192 0 

Ala Thr His Gly Ala Glu lie Pro Tyr Val Phe Gly Val Phe Lys Leu 
625 630 635 640 

gac gag ttg ggt ctg ttc gat tgg ccg ccc gag ggg ccc acg ccc gee 1968 
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Asp Glu Leu Gly Leu Phe Asp Trp Pro Pro Glu Gly Pro Thr Pro Ala 
645 650 655 

gac cgt gcg ctg ggc caa ctg atg tec tec gec tgg gtc egg ttc gec 
Asp Arg Ala Leu Gly Gin Leu Met Ser Ser Ala Trp Val Arg Phe Ala 
6S0 665 670 

aag aat ggc gac ccc gec ggg gac gec ctt acc tgg cct gec tat tct 
Lys Asn Gly Asp Pro Ala Gly Asp Ala Leu Thr Trp Pro Ala Tyr Ser 
675 680 685 

acg ggc aag teg acc atg aca ttc ggt ccc gag ggc cgc gcg gcg gtg 
Thr Gly Lys Ser Thr Met Thr Phe Gly Pro Glu Gly Arg Ala Ala Val 
690 695 700 

gtg teg ccc gga cct tec ate ccc cct tgc gcg gat ggc gec aag gcg 
val Ser Pro Gly Pro Ser lie Pro Pro Cys Ala Asp Gly Ala Lys Ala 
705 710 715 720 

ggg ggc gga ggc age ggc gga ggc age ggc gga ggc age aaa gac aac 
Gly Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Lys Asp Asn 
725 730 735 

gtt gcg gac gtg gta gtg gtg ggc get ggc ttg age ggt ttg gag acg 
Val Ala Asp Val Val Val Val Gly Ala Gly Leu Ser Gly Leu Glu Thr 
740 745 750 

gca cgc aaa gtc cag gec gec ggt ctg tec tgc etc gtt ctt gag gcg 
Ala Arg Lys Val Gin Ala Ala Gly Leu Ser Cys Leu Val Leu Glu Ala 
755 760 765 

atg gat cgt gta ggg gga aag act ctg age gta caa teg ggt ccc ggc 
Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser Gly Pro Gly 
770 775 780 

agg acg act ate aac gac etc ggc get gcg tgg ate aat gac age aac 
Ara Thr Thr lie Asn Asp Leu Gly Ala Ala Trp He Asn Asp Ser Asn 
785 790 795 800 

caa age gaa gta tec aga ttg ttt gaa aga ttt cat ttg gag ggc gag 
Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu Glu Gly Glu 
805 810 815 

etc cag agg acg act gga aat tea ate cat caa gca caa gac ggt aca 
Leu Gin Arg Thr Thr Gly Asn Ser He His Gin Ala Gin Asp Gly Thr 
820 825 830 

acc act aca get cct tat ggt gac tec ttg ctg age gag gag gtt gca 
Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu Glu Val Ala 
835 840 845 

agt gca ctt gcg gaa etc etc ccc gta tgg tct cag ctg ate gaa gag 
Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu He Glu Glu 
850 855 860 

cat age ctt caa gac etc aag gcg age cct cag gcg aag egg etc gac 
His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys Arg Leu Asp 
865 870 875 880 

agt gtg age ttc gcg cac tac tgt gag aag gaa eta aac ttg cct get 
Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn Leu Pro Ala 
885 890 895 
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2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 



2688 



gtt etc ggc gta gca aac cag ate aca cgc get ctg etc ggt gtg gaa 2 736 

Val Leu Gly Val Ala Asn Gin lie Thr Arg Ala Leu Leu Gly Val Glu 
900 905 910 

gec cac gag ate age atg ctt ttt etc acc gac tac ate aag agt gee 2 784 

Ala His Glu lie Ser Met Leu Phe Leu Thr Asp Tyr lie Lys Ser Ala 
915 920 925 

acc ggt etc agt aat att ttc teg gac aag aaa gac ggc ggg cag tat 2 832 

Thr Gly Leu Ser Asn lie Phe Ser Asp Lys Lys Asp Gly Gly Gin Tyr 
930 935 940 

atg cga tgc aaa aca ggt atg cag teg att tgc cat gec atg tea aag 2880 
Met Arg Cys Lys Thr Gly Met Gin Ser He Cys His Ala Met Ser Lys 
945 950 955 960 

gaa ctt gtt cca ggc tea gtg cac etc aac acc ccc gtc get gaa att 2928 
Glu Leu Val Pro Gly Ser Val His Leu Asn Thr Pro Val Ala Glu He 
965 970 975 

gag cag teg gca tec ggc tgt aca gta cga teg gee teg ggc gec gtg 2 976 

Glu Gin Ser Ala Ser Gly Cys Thr Val Arg Ser Ala Ser Gly Ala Val 
980 985 990 

ttc cga age aaa aag gtg gtg gtt teg tta ccg aca acc ttg tat ccc 3 024 

Phe Arg Ser Lys Lys Val Val Val Ser Leu Pro Thr Thr Leu Tyr Pro 
995 1000 1005 

acc ttg aca ttt tea cca cct ctt ccc gec gag aag caa gca ttg gcg 3 072 

Thr Leu Thr Phe Ser Pro Pro Leu Pro Ala Glu Lys Gin Ala Leu Ala 
1010 1015 1020 

gaa aat tct ate ctg ggc tac tat age aag ata gtc ttc gta tgg gac 3120 
Glu Asn Ser He Leu Gly Tyr Tyr Ser Lys He Val Phe Val Trp Asp 
1025 1030 1035 1040 

aag ccg tgg tgg cgc gaa caa ggc ttc teg ggc gtc etc caa teg age 3168 
Lys Pro Trp Trp Arg Glu Gin Gly Phe Ser Gly Val Leu Gin Ser Ser 
1045 1050 1055 

tgt gac ccc ate tea ttt gee aga gat acc age ate gac gtc gat cga 3216 
Cys Asp Pro He Ser Phe Ala Arg Asp Thr Ser He Asp Val Asp Arg 
1060 1065 1070 

caa tgg tec att acc tgt ttc atg gtc gga gac ccg gga egg aag tgg 3264 
Gin Trp Ser He Thr Cys Phe Met Val Gly Asp Pro Gly Arg Lys Trp 
1075 1080 1085 

tec caa cag tec aag cag gta cga caa aag tct gtc tgg gac caa etc 3312 
Ser Gin Gin Ser Lys Gin Val Arg Gin Lys Ser Val Trp Asp Gin Leu 
1090 1095 1100 

cgc gca gee tac gag aac gee ggg gec caa gtc cca gag ccg gee aac 3360 
Arg Ala Ala Tyr Glu Asn Ala Gly Ala Gin Val Pro Glu Pro Ala Asn 
1105 1110 1115 1120 

gtg etc gaa ate gag tgg teg aag cag cag tat ttc caa gga get ccg 34 08 

Val Leu Glu He Glu Trp Ser Lys Gin Gin Tyr Phe Gin Gly Ala Pro 
1125 1130 1135 

age gee gtc tat ggg ctg aac gat etc ate aca ctg ggt teg gcg etc 3456 
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Ser Ala Val Tyr Gly Leu Asn Asp Leu lie Thr Leu Gly Ser Ala Leu 
1140 1145 1150 

aga acg ccg ttc aag agt gtt cat ttc gtt gga acg gag acg tct tta 3504 
Arg Thr Pro Phe Lys Ser Val His Phe Val Gly Thr Glu Thr Ser Leu 
1155 1160 1165 

gtt tgg aaa ggg tat atg gaa ggg gcc ata cga teg ggt caa cga ggt 3552 
Val Trp Lys Gly Tyr Met Glu Gly Ala lie Arg Ser Gly Gin Arg Gly 
1170 1175 1180 

get gca gaa gtt gtg get age ctg gtg cca gca gca tag 3591 
Ala Ala Glu Val Val Ala Ser Leu Val Pro Ala Ala 
1185 1190 1195 



<210> 31 
<211> 1196 
<212> PRT 
<213> Unknown 



<400> 31 



Met 


Ser 


Pro 


He 


Leu 


Gly 


Tyr 


Trp 


Lys 


lie 


Lys 


Gly 


Leu 


Val 


Gin 


Pro 


1 








5 










10 










15 




Thr 


Arg 


Leu 


Leu 


Leu 


Glu 


Tyr 


Leu 


Glu 


Glu 


Lys 


Tyr 


Glu 


Glu 


His 


Leu 








20 










25 










30 






Tyr 


Glu 


Arg 


Asp 


Glu 


Gly 


Asp 


Lys 


Trp 


Arg 


Asn 


Lys 


Lys 


Phe 


Glu 


Leu 






35 










40 










45 








Gly 


Leu 


Glu 


Phe 


Pro 


Asn 


Leu 


Pro 


Tyr 


Tyr 


He 


Asp 


Gly 


Asp 


Val 


Lys 




50 










55 










60 










Leu 


Thr 


Gin 


Ser 


Met 


Ala 


He 


He 


Arg 


Tyr 


He 


Ala 


Asp 


Lys 


His 


Asn 


65 










70 










75 










80 


Met 


Leu 


Gly 


Gly 


Cys 

2 


Pro 


Lys 


Glu 


Arg 


Ala 


Glu 


He 


Ser 


Met 


Leu 


Glu 










85 










90 










95 




Gly 


Ala 


Val 


Leu 


Asp 


He 


Arg 


Tyr 


Gly Val 


Ser 


Arg 


lie 


Ala 


Tyr 


Ser 








100 










105 










110 






Lys 


Asp 


Phe 


Glu 


Thr 


Leu 


Lys 


Val 


Asp 


Phe 


Leu 


Ser 


Lys 


Leu 


Pro 


Glu 






115 










120 










125 








Met 


Leu 


Lys 


Met 


Phe 


Glu 


Asp 


Arg 


Leu 


Cys 


His 


Lys 


Thr 


Tyr 


Leu 


Asn 




130 










135 










140 










Gly 


Asp 


His 


Val 


Thr 


His 


Pro 


Asp 


Phe 


Met 


Leu 


Tyr 


Asp 


Ala 


Leu 


Asp 


145 










150 










155 










160 


Val 


Val 


Leu 


Tyr 


Met 


Asp 


Pro 


Met 


Cys 


Leu 


Asp 


Ala 


Phe 


Pro 


Lys 


Leu 










165 










170 










175 




Val 


Cys 


Phe 


Lys 


Lys 


Arg 


He 


Glu 


Ala 


lie 


Pro 


Gin 


He 


Asp 


Lys 


Tyr 








180 










185 










190 






Leu 


Lys 


Ser 


Ser 


Lys 


Tyr 


He 


Ala 


Trp 


Pro 


Leu 


Gin 


Gly 


Trp 


Gin 


Ala 






195 










200 










205 








Thr 


Phe 


Gly 


Gly 


Gly 


Asp 


His 


Pro 


Pro 


Lys 


Ser 


Asp 


Leu 


Val 


Pro 


Arg 




210 










215 










220 










Gly 


Ser 


Pro 


Glu 


Phe 


Thr 


Asp 


Phe 


Pro 


Val 


Arg 


Arg 


Thr 


Asp 


Leu 


Gly 


225 










230 










235 










240 


Gin 


Val 


Gin 


Gly 


Leu 


Ala 


Gly Asp Val 


Met 


Ser 


Phe 


Arg 


Gly 


He 


Pro 










245 










250 










255 




Tyr 


Ala 


Ala 


Pro 


Pro 


Val 


Gly 


Gly Leu Arg 


Trp 


Lys 


Pro 


Pro 


Gin 


His 








260 










265 










270 






Ala 


Arg 


Pro 


Trp 


Ala 


Gly 


Val 


Arg 


Pro 


Ala 


Thr 


Gin 


Phe 


Gly 


Ser 


Asp 






275 










280 










285 








Cys 


Phe 


Gly 


Ala 


Ala 


Tyr 


Leu Arg 


Lys 


Gly 


Ser 


Leu 


Ala 


Pro 


Gly 


Val 




290 










295 










300 










Ser 


Glu 


Asp 


Cys 


Leu 


Tyr 


Leu 


Asn 


Val 


Trp 


Ala 


Pro 


Ser 


Gly 


Ala 


Lys 


305 










310 










315 










320 
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Pro Gly Gin Tyr 

Gly Gly Thr Ala 
340 

Gin Gly Val Val 
355 

Phe Phe Ala His 
370 

Gly Asn Tyr Gly 
385 

Ser Asn Ala Arg 

Gly Glu Ser Ala 
420 

Leu Ser Lys Gly 
435 

Thr Arg Pro Leu 
450 

Leu Asp Ala Asp 
465 

Met Ala Arg Ala 

Pro Arg Pro Thr 
500 

Asp Ser Ala Ala 
515 

lie Gly Thr Asn 
530 

Met Glu Thr Pro 
545 

Asp Gin Ala Ala 

Thr Pro Lys Glu 
580 

Arg Gly Val Ser 
595 

Val Trp Arg Tyr 
610 

Ala Thr His Gly 
625 

Asp Glu Leu Gly 

Asp Arg Ala Leu 
660 

Lys Asn Gly Asp 
675 

Thr Gly Lys Ser 
690 

Val Ser Pro Gly 
705 

Gly Gly Gly Gly 

Val Ala Asp Val 
740 

Ala Arg Lys Val 
755 

Met Asp Arg Val 
770 

Arg Thr Thr lie 
785 

Gin Ser Glu Val 



Pro Val Met Val 
325 

Ala Met Pro Tyr 

Val Val Thr Phe 
360 

Pro Gly Leu Ser 
375 

Leu Leu Asp lie 
390 

Ala Phe Gly Gly 
405 

Gly Ala Ser Ala 

Leu Phe Arg Gly 
440 

Ala Thr Leu Ala 
455 

Leu Ser Arg Leu 
470 

Asp Ala Ala Arg 
485 

Gly Pro lie Val 

lie Ala Ala Gly 
520 

Ala Asp Glu Gly 
535 

Ala Asp Tyr Gin 
550 

Ala Val Ala Ala 
565 

Met Val Ala Arg 

Ala Phe Ser Glu 
600 

Gin Phe Asn Gly 
615 

Ala Glu lie Pro 
630 

Leu Phe Asp Trp 
645 

Gly Gin Leu Met 

Pro Ala Gly Asp 
680 

Thr Met Thr Phe 
695 

Pro Ser lie Pro 
710 

Ser Gly Gly Gly 
725 

Val Val Val Gly 

Gin Ala Ala Gly 
760 

Gly Gly Lys Thr 
775 

Asn Asp Leu Gly 
790 

Ser Arg Leu Phe 
805 



Trp Val Tyr Gly 
330 

Tyr Asp Gly Glu 
345 

Asn Tyr Arg Thr 

Arg Glu Ser Pro 
380 

Leu Ala Ala Leu 
395 

Asp Pro Gly Arg 
410 

lie Gly Leu Leu 
425 

Ala lie Leu Glu 

Asp Ser Ala Ala 
460 

Arg Ser Thr Asp 
475 

Pro Ala Ser Arg 
490 

Asp Gly His Val 
505 

Gin Leu Ala Pro 

Arg Ala Phe Leu 
540 

Ala Tyr Leu Glu 
555 

Cys Tyr Pro Leu 
570 

lie Phe Gly Asp 
585 

Ala Leu Val Arg 

Asn Thr Glu Gly 
620 

Tyr Val Phe Gly 
635 

Pro Pro Glu Gly 
650 

Ser Ser Ala Trp 
665 

Ala Leu Thr Trp 

Gly Pro Glu Gly 
700 

Pro Cys Ala Asp 
715 

Ser Gly Gly Gly 
730 

Ala Gly Leu Ser 
745 

Leu Ser Cys Leu 

Leu Ser Val Gin 
780 

Ala Ala Trp lie 
795 

Glu Arg Phe His 
810 

58 



Gly Gly Phe Ala 
335 

Ala Leu Ala Arg 
350 

Asn lie Leu Gly 
365 

Thr Gly Thr Ser 

Arg Trp Val Gin 
400 

Val Thr Val Phe 
415 

Leu Thr Ser Pro 
430 

Ser Pro Gly Leu 
445 

Ser Gly Glu Arg 

Pro Ala Thr Leu 
480 

Asp Leu Arg Arg 
495 

Leu Pro Gin Thr 
510 

Val Arg Val Leu 
525 

Gly Arg Ala Pro 

Ala Gin Phe Gly 
560 

Asp Gly Arg Ala 
575 

Asn Gin Phe Asn 
590 

Gin Gly Ala Pro 
605 

Gly Arg Ala Pro 

Val Phe Lys Leu 
640 

Pro Thr Pro Ala 
655 

Val Arg Phe Ala 
670 

Pro Ala Tyr Ser 
685 

Arg Ala Ala Val 

Gly Ala Lys Ala 
720 

Ser Lys Asp Asn 
735 

Gly Leu Glu Thr 
750 

Val Leu Glu Ala 
765 

Ser Gly Pro Gly 

Asn Asp Ser Asn 
800 

Leu Glu Gly Glu 
815 



Leu 


Gin 


Arg 


Thr 
820 


Thr 


Gly 


Asn 


Ser 


He 
825 


His 


Gin 


Ala 


Gin 


Asp 
830 


Gly 


Thr 


Thr 


Thr 


Thr 
835 


Ala 


Pro 


Tyr 


Gly 


Asp 
840 


Ser 


Leu 


Leu 


Ser 


Glu 
845 


Glu 


Val 


Ala 


Ser 


Ala 
850 


Leu 


Ala 


Glu 


Leu 


Leu 
855 


Pro 


Val 


Trp 


Ser 


Gin 
860 


Leu 


He 


Glu 


Glu 


His 


Ser 


Leu 


Gin 


Asp 


Leu 


Lys 


Ala 


Ser 


Pro 


Gin 


Ala 


Lys 


Arg 


Leu 


Asp 


865 










870 










875 










880 


Ser 


Val 


Ser 


Phe 


Ala 
885 


His 


Tyr 


Cys 


Glu 


Lys 
890 


Glu 


Leu 


Asn 


Leu 


Pro 
895 


Ala 


Val 


Leu 


Gly 


Val 
900 


Ala 


Asn 


Gin 


He 


Thr 
905 


Arg 


Ala 


Leu 


Leu 


Gly 
910 


Val 


Glu 


Ala 


His 


Glu 
915 


He 


Ser 


Met 


Leu 


Phe 
920 


Leu 


Thr 


Asp 


Tyr 


He 
925 


Lys 


Ser 


Ala 


Thr 


Gly 
930 


Leu 


Ser 


Asn 


He 


Phe 
935 


Ser 


Asp 


Lys 


Lys 


Asp 
940 


Gly 


Gly 


Gin 


Tyr 


Met 


Arg 


Cys 


Lys 


Thr 


Gly 


Met 


Gin 


Ser 


He 


Cys 


His 


Ala 


Met 


Ser 


Lys 


945 










950 










955 










960 


Glu 


Leu 


Val 


Pro 


Gly 
965 


Ser 


Val 


His 


Leu 


Asn 
970 


Thr 


Pro 


Val 


Ala 


Glu 
975 


He 


Glu 


Gin 


Ser 


Ala 
980 


Ser 


Gly 


Cys 


Thr 


Val 
985 


Arg 


Ser 


Ala 


Ser 


Gly 
990 


Ala 


Val 


Phe 


Arg 


Ser 


Lys 


Lys 


Val 


Val 


Val 


Ser 


Leu 


Pro 


Thr 


Thr 


Leu 


Tyr 


Pro 






995 










1000 








1005 






Thr 


Leu 


Thr 


Phe 


Ser 


Pro 


Pro 


Leu 


Pro 


Ala 


Glu 


Lys 


Gin 


Ala 


Leu 


Ala 




1010 








1015 








1020 








Glu 


Asn 


Ser 


He 


Leu 


Gly 


Tyr 


Tyr 


Ser 


Lys 


He 


Val 


Phe 


Val 


Trp 


Asp 


1025 








1030 








1035 








104 


Lys 


Pro 


Trp 


Trp 


Arg 


Glu 


Gin 


Gly 


Phe 


Ser 


Gly Val 


Leu 


Gin 


Ser 


Ser 










1045 








1050 








1055 


Cys 


Asp 


Pro 


He 


Ser 


Phe 


Ala 


Arg 


Asp 


Thr 


Ser 


He 


Asp 


Val 


Asp 


Arg 








1060 








1065 








1070 




Gin 


Trp 


Ser 


He 


Thr 


Cys 


Phe 


Met 


Val 


Gly Asp 


Pro 


Gly Arg 


Lys 


Trp 






1075 








1080 








1085 






Ser 


Gin 


Gin 


Ser 


Lys 


Gin 


Val 


Arg 


Gin 


Lys 


Ser 


Val 


Trp 


Asp 


Gin 


Leu 




1090 








1095 








1100 








Arg 


Ala 


Ala 


Tyr 


Glu 


Asn 


Ala 


Gly Ala 


Gin 


Val 


Pro 


Glu 


Pro 


Ala 


Asn 


1105 








1110 








1115 








112 


Val 


Leu 


Glu 


He 


Glu 


Trp 


Ser 


Lys 


Gin 


Gin 


Tyr 


Phe 


Gin 


Gly Ala 


Pro 










1125 








1130 








1135 


Ser 


Ala 


Val 


Tyr 


Gly 


Leu 


Asn 


Asp 


Leu 


He 


Thr 


Leu 


Gly 


Ser 


Ala 


Leu 








1140 








1145 








1150 




Arg 


Thr 


Pro 


Phe 


Lys 


Ser 


Val 


His 


Phe 


Val 


Gly Thr 


Glu 


Thr 


Ser 


Leu 






1155 








1160 








1165 






Val 


Trp 


Lys 


Gly 


Tyr 


Met 


Glu 


Gly Ala 


He 


Arg 


Ser Gly Gin Arg 


Gly 




1170 








1175 








1180 








Ala 


Ala 


Glu 


Val 


Val 


Ala 


Ser 


Leu 


Val 


Pro 


Ala 


Ala 
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<210> 32 
<211> 1803 
<212> DNA 
<213> Unknown 

<220> 

<221> CDS 

<222> (1) . . . (1803) 

<223> Glyc(-)APAO coding sequence; mutation in putative 
glycosylation sites 

<400> 32 



59 



atg gca ctt gca ccg age tac ate aat ccc cca aac gtc gec tec cca 48 
Met Ala Leu Ala Pro Ser Tyr lie Asn Pro Pro Asn Val Ala Ser Pro 
15 10 15 

gca ggg tat tct cac gtc ggc gta ggc cca gac gga ggg agg tat gtg 96 
Ala Gly Tyr Ser His Val Gly Val Gly Pro Asp Gly Gly Arg Tyr Val 
20 25 30 

aca ata get gga cag att gga caa gac get teg ggc gtg aca gac cct 144 
Thr Xle Ala Gly Gin lie Gly Gin Asp Ala Ser Gly Val Thr Asp Pro 
35 40 45 

gec tac gag aaa cag gtt gec caa gca ttc gec aat ctg cga get tgc 192 
Ala Tyr Glu Lys Gin Val Ala Gin Ala Phe Ala Asn Leu Arg Ala Cys 
50 55 60 

ctt get gca gtt gga gec act tea aac gac gtc acc aag etc aat tac 240 
Leu Ala Ala Val Gly Ala Thr Ser Asn Asp Val Thr Lys Leu Asn Tyr 
65 70 75 80 

tac ate gtc gac tac gec ccg age aaa etc acc gca att gga gat ggg 2 88 

Tyr lie Val Asp Tyr Ala Pro Ser Lys Leu Thr Ala lie Gly Asp Gly 
85 90 95 

ctg aag get acc ttt gec ctt gac agg etc cct cct tgc acg ctg gtg 336 
Leu Lys Ala Thr Phe Ala Leu Asp Arg Leu Pro Pro Cys Thr Leu Val 
100 105 110 

cca gtg teg gec ttg tct tea cct gaa tac etc ttt gag gtt gat gec 384 
Pro Val Ser Ala Leu Ser Ser Pro Glu Tyr Leu Phe Glu Val Asp Ala 
115 120 125 

acg gcg ctg gtg ccg gga cac acg acc cca gac aac gtt gcg gac gtg 432 
Thr Ala Leu Val Pro Gly His Thr Thr Pro Asp Asn Val Ala Asp Val 
130 135 140 

gta gtg gtg ggc get ggc ttg age ggt ttg gag acg gca cgc aaa gtc 480 
Val Val Val Gly Ala Gly Leu Ser Gly Leu Glu Thr Ala Arg Lys Val 
145 150 155 160 

cag gee gec ggt ctg tec tgc etc gtt ctt gag gcg atg gat cgt gta 528 
Gin Ala Ala Gly Leu Ser Cys Leu Val Leu Glu Ala Met Asp Arg Val 
165 170 175 

ggg gga aag act ctg age gta caa teg ggt ccc ggc agg acg act ate 576 
Gly Gly Lys Thr Leu Ser Val Gin Ser Gly Pro Gly Arg Thr Thr lie 
180 185 190 

aac gac etc ggc get gcg tgg ate aat gat age aat cag gee gaa gta 624 
Asn Asp Leu Gly Ala Ala Trp lie Asn Asp Ser Asn Gin Ala Glu Val 
195 200 205 

tec aga ttg ttt gaa aga ttt cat ttg gag ggc gag etc cag agg acg 672 
Ser Arg Leu Phe Glu Arg Phe His Leu Glu Gly Glu Leu Gin Arg Thr 
210 215 220 

act gga aat tea ate cat caa gca caa gac ggt aca acc act aca get 72 0 

Thr Gly Asn Ser lie His Gin Ala Gin Asp Gly Thr Thr Thr Thr Ala 
225 230 235 240 

cct tat ggt gac tec ttg ctg age gag gag gtt gca agt gca ctt gcg 768 
Pro Tyr Gly Asp Ser Leu Leu Ser Glu Glu Val Ala Ser Ala Leu Ala 



60 



245 



250 



255 



gaa etc etc 
Glu Leu Leu 



gac etc aag 
Asp Leu Lys 
275 

gcg cac tac 
Ala His Tyr 
290 

gca aac cag 
Ala Asn Gin 
305 

age atg ctt 
Ser Met Leu 



aat att ttc 
Asn lie Phe 



aca ggt atg 
Thr Gly Met 
355 

ggc tea gtg 
Gly Ser Val 
370 

tec ggc tgt 
Ser Gly Cys 
385 

aag gtg gtg 
Lys Val Val 



tea cca cct 
Ser Pro Pro 



ctg ggc tac 
Leu Gly Tyr 
435 

egc gaa caa 
Arg Glu Gin 
450 

tea ttt gee 
Ser Phe Ala 
465 

acc tgt ttc 
Thr Cys Phe 



ccc gta tgg 
Pro Val Trp 
260 

gcg age cct 
Ala Ser Pro 



tgt gag aag 
Cys Glu Lys 



ate aca cgc 
lie Thr Arg 
310 

ttt etc acc 
Phe Leu Thr 
325 

teg gac aag 
Ser Asp Lys 
340 

cag teg att 
Gin Ser lie 



cac etc aac 
His Leu Asn 



aca gta cga 
Thr Val Arg 
390 

gtt teg tta 
Val Ser Leu 
405 

ctt ccc gee 
Leu Pro Ala 
420 

tat age aag 
Tyr Ser Lys 



ggc ttc teg 
Gly Phe Ser 



aga gat acc 
Arg Asp Thr 
470 

atg gtc gga 
Met Val Gly 
485 



tct cag ctg 
Ser Gin Leu 
265 

cag gcg aag 
Gin Ala Lys 
280 

gaa eta aac 
Glu Leu Asn 
295 

get ctg etc 
Ala Leu Leu 



gac tac ate 
Asp Tyr lie 



aaa gac ggc 
Lys Asp Gly 
345 

tgc cat gee 
Cys His Ala 
360 

acc ccc gtc 
Thr Pro Val 
375 

teg gee teg 
Ser Ala Ser 



ccg aca acc 
Pro Thr Thr 



gag aag caa 
Glu Lys Gin 
425 

ata gtc ttc 
lie Val Phe 
440 

ggc gtc etc 
Gly Val Leu 
455 

age ate gac 
Ser lie Asp 



gac ccg gga 
Asp Pro Gly 



ate gaa gag 
lie Glu Glu 



egg etc gac 
Arg Leu Asp 



ttg cct get 
Leu Pro Ala 
300 

ggt gtg gaa 
Gly Val Glu 
315 

aag agt gee 
Lys Ser Ala 
330 

ggg cag tat 
Gly Gin Tyr 



atg tea aag 
Met Ser Lys 



get gaa att 
Ala Glu lie 
380 

ggc gee gtg 
Gly Ala Val 
395 

ttg tat ccc 
Leu Tyr Pro 
410 

gca ttg gcg 
Ala Leu Ala 



gta tgg gac 
Val Trp Asp 



caa teg age 
Gin Ser Ser 
460 

gtc gat cga 
Val Asp Arg 
475 

egg aag tgg 
Arg Lys Trp 
490 



cat age ctt 
His Ser Leu 
270 

agt gtg age 
Ser Val Ser 
285 

gtt etc ggc 
Val Leu Gly 



gee cac gag 
Ala His Glu 



acc ggt etc 
Thr Gly Leu 
335 

atg cga tgc 
Met Arg Cys 
350 

gaa ctt gtt 
Glu Leu Val 
365 

gag cag teg 
Glu Gin Ser 



ttc cga age 
Phe Arg Ser 



acc ttg aca 
Thr Leu Thr 
415 

gaa aat tct 
Glu Asn Ser 
430 

aag ccg tgg 
Lys Pro Trp 
445 

tgt gac ccc 
Cys Asp Pro 



caa tgg tec 
Gin Trp Ser 



tec caa cag 
Ser Gin Gin 
495 



caa 816 
Gin 



ttc 864 
Phe 



gta 912 
Val 



ate 960 

lie 

320 

agt 1008 
Ser 



aaa 1056 
Lys 



cca 1104 
Pro 



gca 1152 
Ala 



aaa 1200 

Lys 

400 

ttt 1248 
Phe 



ate 1296 
He 



tgg 1344 
Trp 



ate 1392 
He 



att 1440 

He 

480 

tec 1488 
Ser 
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aag cag gta cga caa aag tct gtc tgg gac caa etc cgc gca gec tac 1536 
Lys Gin Val Arg Gin Lys Ser Val Trp Asp Gin Leu Arg Ala Ala Tyr 
500 505 510 



gag aac gec ggg gec caa gtc cca gag ccg gec aac gtg etc gaa ate 1584 
Glu Asn Ala Gly Ala Gin Val Pro Glu Pro Ala Asn Val Leu Glu lie 
515 520 525 

gag tgg teg aag cag cag tat ttc caa gga get ccg age gec gtc tat 1632 
Glu Trp Ser Lys Gin Gin Tyr Phe Gin Gly Ala Pro Ser Ala Val Tyr 
530 535 540 

999 ctg aac gat etc ate aca ctg ggt teg gcg etc aga acg ccg ttc 1680 
Gly Leu Asn Asp Leu lie Thr Leu Gly Ser Ala Leu Arg Thr Pro Phe 
545 550 555 560 

aag agt gtt cat ttc gtt gga acg gag acg tct tta gtt tgg aaa ggg 1728 
Lys Ser Val His Phe Val Gly Thr Glu Thr Ser Leu Val Trp Lys Gly 
565 570 575 

tat atg gaa ggg gec ata cga teg ggt caa cga ggt get gca gaa gtt 1776 
Tyr Met Glu Gly Ala lie Arg Ser Gly Gin Arg Gly Ala Ala Glu Val 
580 585 590 

gtg get age ctg gtg cca gca gca tag 18 03 

Val Ala Ser Leu Val Pro Ala Ala * 
595 600 



<210> 33 
<211> 600 
<212> PRT 
<213> Unknown 



<400> 33 
Met Ala Leu Ala 
1 

Ala Gly Tyr Ser 
20 

Thr lie Ala Gly 
35 

Ala Tyr Glu Lys 
50 

Leu Ala Ala Val 
65 

Tyr lie Val Asp 

Leu Lys Ala Thr 
100 

Pro Val Ser Ala 
115 

Thr Ala Leu Val 
130 

Val Val Val Gly 
145 

Gin Ala Ala Gly 

Gly Gly Lys Thr 
180 

Asn Asp Leu Gly 
195 

Ser Arg Leu Phe 



Pro Ser Tyr lie 
5 

His Val Gly Val 

Gin lie Gly Gin 
40 

Gin Val Ala Gin 
55 

Gly Ala Thr Ser 
70 

Tyr Ala Pro Ser 
85 

Phe Ala Leu Asp 

Leu Ser Ser Pro 
120 

Pro Gly His Thr 
135 

Ala Gly Leu Ser 
150 

Leu Ser Cys Leu 
165 

Leu Ser Val Gin 

Ala Ala Trp lie 
200 

Glu Arg Phe His 



Asn Pro Pro Asn 
10 

Gly Pro Asp Gly 
25 

Asp Ala Ser Gly 

Ala Phe Ala Asn 
60 

Asn Asp Val Thr 
75 

Lys Leu Thr Ala 
90 

Arg Leu Pro Pro 
105 

Glu Tyr Leu Phe 

Thr Pro Asp Asn 
14 0 

Gly Leu Glu Thr 
155 

Val Leu Glu Ala 
170 

Ser Gly Pro Gly 
185 

Asn Asp Ser Asn 
Leu Glu Gly Glu 



Val Ala Ser Pro 
15 

Gly Arg Tyr Val 
30 

Val Thr Asp Pro 
45 

Leu Arg Ala Cys 

Lys Leu Asn Tyr 
80 

lie Gly Asp Gly 
95 

Cys Thr Leu Val 
110 

Glu Val Asp Ala 
125 

Val Ala Asp Val 

Ala Arg Lys Val 
160 

Met Asp Arg Val 
175 

Arg Thr Thr lie 
190 

Gin Ala Glu Val 
205 

Leu Gin Arg Thr 
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210 

Thr Gly Asn Ser 
225 

Pro Tyr Gly Asp 

Glu Leu Leu Pro 
260 

Asp Leu Lys Ala 
275 

Ala His Tyr Cys 
290 

Ala Asn Gin lie 
305 

Ser Met Leu Phe 

Asn lie Phe Ser 
340 

Thr Gly Met Gin 
355 

Gly Ser Val His 
370 

Ser Gly Cys Thr 
385 

Lys Val Val Val 

Ser Pro Pro Leu 
420 

Leu Gly Tyr Tyr 
435 

Arg Glu Gin Gly 
450 

Ser Phe Ala Arg 
465 

Thr Cys Phe Met 

Lys Gin Val Arg 
500 

Glu Asn Ala Gly 
515 

Glu Trp Ser Lys 
530 

Gly Leu Asn Asp 
545 

Lys Ser Val His 

Tyr Met Glu Gly 
580 

Val Ala Ser Leu 
595 



215 

lie His Gin Ala 
230 

Ser Leu Leu Ser 
245 

Val Trp Ser Gin 

Ser Pro Gin Ala 
280 

Glu Lys Glu Leu 
295 

Thr Arg Ala Leu 
310 

Leu Thr Asp Tyr 
325 

Asp Lys Lys Asp 

Ser lie Cys His 
360 

Leu Asn Thr Pro 
375 

Val Arg Ser Ala 
390 

Ser Leu Pro Thr 
405 

Pro Ala Glu Lys 

Ser Lys lie Val 
440 

Phe Ser Gly Val 
455 

Asp Thr Ser lie 
470 

Val 7 Gly Asp Pro 
485 

Gin Lys Ser Val 

Ala Gin Val Pro 
520 

Gin Gin Tyr Phe 
535 

Leu lie Thr Leu 
550 

Phe Val Gly Thr 
565 

Ala lie Arg Ser 

Val Pro Ala Ala 
600 



220 

Gin Asp Gly Thr 
235 

Glu Glu Val Ala 
250 

Leu lie Glu Glu 
265 

Lys Arg Leu Asp 

Asn Leu Pro Ala 
300 

Leu Gly Val Glu 
315 

lie Lys Ser Ala 
330 

Gly Gly Gin Tyr 
345 

Ala Met Ser Lys 

Val Ala Glu lie 
380 

Ser Gly Ala Val 
395 

Thr Leu Tyr Pro 
410 

Gin Ala Leu Ala 
425 

Phe Val Trp Asp 

Leu Gin Ser Ser 
460 

Asp Val Asp Arg 
475 

Gly Arg Lys Trp 
490 

Trp Asp Gin Leu 
505 

Glu Pro Ala Asn 

Gin Gly Ala Pro 
540 

Gly Ser Ala Leu 
555 

Glu Thr Ser Leu 
570 

Gly Gin Arg Gly 
585 



Thr Thr Thr Ala 
240 

Ser Ala Leu Ala 
255 

His Ser Leu Gin 
270 

Ser Val Ser Phe 
285 

Val Leu Gly Val 

Ala His Glu lie 
320 

Thr Gly Leu Ser 
335 

Met Arg Cys Lys 
350 

Glu Leu Val Pro 
365 

Glu Gin Ser Ala 

Phe Arg Ser Lys 
400 

Thr Leu Thr Phe 
415 

Glu Asn Ser lie 
430 

Lys Pro Trp Trp 
445 

Cys Asp Pro lie 

Gin Trp Ser lie 
480 

Ser Gin Gin Ser 
495 

Arg Ala Ala Tyr 
510 

Val Leu Glu lie 
525 

Ser Ala Val Tyr 

Arg Thr Pro Phe 
560 

Val Trp Lys Gly 
575 

Ala Ala Glu Val 
590 



<210> 34 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 37-mer oligonucleotide 



<400> 34 

ggggaattca tggcacttgc accgagctac atcaatc 



<210> 35 



63 



<211> 1929 
<212> DNA 

<213> Exophiala spixiifera 
<220> 

<221> intron 

<222> (739) . . . (811) 

<221> intron 

<222> (1134) . . . (1186) 



<400> 35 

atggcacttg caccgagcta catcaatccc ccaaacgtcg cctccccagc agggtattcc 60 

cacatcggcg taggcccaaa cgaagcgagg tatgtgacaa tagctggaca gattggacaa 120 

gacgctttgg gcgtgacaga cccagcctac gagaaacagg ttgcccaagc attcgccaat 180 

ctgcgagctt gccttgctgc agttggagcc tcttcaaacg acgtcaccaa gctcaattac 240 

tacatcgtcg actacgcccc gagcaaactc accgcaattg gagatgggct gaagtctacc 3 00 

tttgcccttg acaggctccc tccttgcacg ctggtgccag taccggcctt ggcttcacct 360 

gaatacctct ttgaggttga tgccacggcg ctggtgccag gacactcgac cccagacaac 420 

gttgcggacg tggtagtggt gggcgctggc ttgagcggtt tggagacggc acgcaaagtc 480 

caggccgccg gtctgtcctg cctcgttctt gaggcgatgg atcgtgtagg gggaaagact 540 

ctgagcgtac aatcgggtcc cggcaggacg actatcaacg acctcggcgc tgcgtggatc 6 00 

aatgacagca accaaagcga agtatccaga ttgtttgaaa gatttcattt ggagggcgag 660 

ctccagagga cgaccggaaa ttcaatccat caagcacaag acggtacaac cactacagct 72 0 

ccttatggtg actccccggt aagcacaatc ccactttgtg atgagacctc tgtcgagtgt 780 

agaatacagt cactgactcc acttcgtcca gctgagcgag gaggttgcaa gtgcacttgc 840 

ggaactcctc cccgtatggt ctcagctgat cgaagagtat agccttgaag accccaaggc 900 

gagccctcag gcgaagcggc tcgacagtgt gagcttcgcg cactactgtg agaaggacct 960 

aaacttgcct gctgttctca gcgtggcaaa ccagatcaca cgcgctctgc tcggtgtgga 1020 

agcccacgag atcagcatgc tttttctcac cgactacatc aagagtgcca ccggtctcag 1080 

taatattgtc tcggacaaga aagacggcgg gcagtatatg cgatgcaaaa caggtgcgtg 1140 

cggtgtcctc tcaggtaggg gactcgtttc ttagtggtca ttccaggtat gcagtcgatt 1200 

tgccatgcca tgtcaaagga acttgttcca ggctcagtgc acctcaacac ccccgtcgct 1260 

ggaattgagc agtcggcgtc cggctgtata gtacgatcgg cctcgggcgc cgtgttccga 1320 

agcaaaaagg tggtggtttc gttaccgaca acattgtatc ccaccttgac attttcacca 1380 

cctcttcccg ccgagaagca agcattggcg gaaaaatcta tcctcggcta ctatagcaag 144 0 

atagtcttcg tatgggacaa cccgtggtgg cgcgaacaag gcttctcggg cgtcctccaa 1500 

tcgagctgtg accccatctc atttgccaga gataccagca tcgaagtcga tcggcaatgg 1560 

tccattacct gtttcatggt cggagacccg ggacggaagt ggtcccaaca gtccaagcag 1620 

gtacgacaaa agtctgtctg ggaccaactc cgcgcagcct acgagaacgc cggggcccaa 16 80 

gtcccagagc cggccaacgt gctcgaaatc gagtggtcga agcagcagta tttccaagga 1740 

gctccgagcg ccgtctatgg gctgaacgat ctcatcacac tgggttcggc gctcagaacg 1800 

ccgttcaagt gtgttcattt cgttggaacg gagacgtctt tagtttggaa agggtatatg 1860 

gaaggggcca tacgatcggg tcaacgaggt gctgcagaag ttgtggctag cctggtgcca 1920 

gcagcatag 1929 



<210> 36 
<211> 600 
<212> PRT 

<213> Exophiala spinifera 
<400> 36 

Met Ala Leu Ala Pro Ser Tyr lie 

1 5 
Ala Gly Tyr Ser His lie Gly Val 
20 

Thr lie Ala Gly Gin lie Gly Gin 

35 40 
Ala Tyr Glu Lys Gin Val Ala Gin 

50 55 
Leu Ala Ala Val Gly Ala Ser Ser 
65 70 



Asn Pro Pro Asn Val Ala Ser Pro 

10 15 
Gly Pro Asn Glu Ala Arg Tyr Val 
25 30 
Asp Ala Leu Gly Val Thr Asp Pro 
45 

Ala Phe Ala Asn Leu Arg Ala Cys 
60 

Asn Asp Val Thr Lys Leu Asn Tyr 
75 80 
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Tyr lie Val Asp 

Leu Lys Ser Thr 
100 

Pro Val Pro Ala 
115 

Thr Ala Leu Val 
130 

Val Val Val Gly 
145 

Gin Ala Ala Gly 

Gly Gly Lys Thr 
180 

Asn Asp Leu Gly 
195 

Ser Arg Leu Phe 
210 

Thr Gly Asn Ser 
225 

Pro Tyr Gly Asp 

Glu Leu Leu Pro 
260 

Asp Pro Lys Ala 
275 

Ala His Tyr Cys 
290 

Ala Asn Gin lie 
305 

Ser Met Leu Phe 

Asn lie Val Ser 
340 

Thr Gly Met Gin 
355 

Gly Ser Val His 
370 

Ser Gly Cys lie 
385 

Lys Val Val Val 

Ser Pro Pro Leu 
420 

Leu Gly Tyr Tyr 
435 

Arg Glu Gin Gly 
450 

Ser Phe Ala Arg 
465 

Thr Cys Phe Met 

Lys Gin Val Arg 
500 

Glu Asn Ala Gly 
515 

Glu Trp Ser Lys 
530 

Gly Leu Asn Asp 
545 

Lys Cys Val His 



Tyr Ala Pro Ser 
85 

Phe Ala Leu Asp 

Leu Ala Ser Pro 
120 

Pro Gly His Ser 
135 

Ala Gly Leu Ser 
150 

Leu Ser Cys Leu 
165 

Leu Ser Val Gin 

Ala Ala Trp lie 
200 

Glu Arg Phe His 

215 

lie His Gin Ala 
230 

Ser Pro Leu Ser 
245 

Val Trp Ser Gin 

Ser Pro Gin Ala 
280 

Glu Lys Asp Leu 
295 

Thr Arg Ala Leu 
310 

Leu Thr Asp Tyr 
325 

Asp Lys Lys Asp 

Ser lie Cys His 
360 

Leu Asn Thr Pro 
375 

Val Arg Ser Ala 
390 

Ser Leu Pro Thr 
405 

Pro Ala Glu Lys 

Ser Lys lie Val 
440 

Phe Ser Gly Val 
455 

Asp Thr Ser lie 
470 

Val Gly Asp Pro 
485 

Gin Lys Ser Val 

Ala Gin Val Pro 
520 

Gin Gin Tyr Phe 
535 

Leu lie Thr Leu 
550 

Phe Val Gly Thr 
565 



Lys Leu Thr Ala 
90 

Arg Leu Pro Pro 
105 

Glu Tyr Leu Phe 

Thr Pro Asp Asn 
140 

Gly Leu Glu Thr 
155 

Val Leu Glu Ala 
170 

Ser Gly Pro Gly 
185 

Asn Asp Ser Asn 
Leu Glu Gly Glu 

220 

Gin Asp Gly Thr 
235 

Glu Glu Val Ala 
250 

Leu lie Glu Glu 
265 

Lys Arg Leu Asp 

Asn Leu Pro Ala 
300 

Leu Gly Val Glu 
315 

lie Lys Ser Ala 
330 

Gly Gly Gin Tyr 
345 

Ala Met Ser Lys 

Val Ala Gly lie 
380 

Ser Gly Ala Val 
395 

Thr Leu Tyr Pro 
410 

Gin Ala Leu Ala 
425 

Phe Val Trp Asp 

Leu Gin Ser Ser 
460 

Glu Val Asp Arg 
475 

Gly Arg Lys Trp 
490 

Trp Asp Gin Leu 
505 

Glu Pro Ala Asn 

Gin Gly Ala Pro 
540 

Gly Ser Ala Leu 
555 

Glu Thr Ser Leu 
570 



lie Gly Asp Gly 
95 

Cys Thr Leu Val 
110 

Glu Val Asp Ala 
125 

Val Ala Asp Val 

Ala Arg Lys Val 
160 

Met Asp Arg Val 
175 

Arg Thr Thr lie 
190 

Gin Ser Glu Val 
205 

Leu Gin Arg Thr 

Thr Thr Thr Ala 
240 

Ser Ala Leu Ala 
255 

Tyr Ser Leu Glu 
270 

Ser Val Ser Phe 
285 

Val Leu Ser Val 

Ala His Glu lie 
320 

Thr Gly Leu Ser 
335 

Met Arg Cys Lys 
350 

Glu Leu Val Pro 
365 

Glu Gin Ser Ala 

Phe Arg Ser Lys 
400 

Thr Leu Thr Phe 
415 

Glu Lys Ser lie 
430 

Asn Pro Trp Trp 
445 

Cys Asp Pro lie 

Gin Trp Ser lie 
480 

Ser Gin Gin Ser 
495 

Arg Ala Ala Tyr 
510 

Val Leu Glu lie 
525 

Ser Ala Val Tyr 

Arg Thr Pro Phe 
560 

Val Trp Lys Gly 
575 



65 



Tyr Met Glu Gly Ala lie Arg Ser Gly Gin Arg Gly Ala Ala Glu Val 

580 585 590 

Val Ala Ser Leu Val Pro Ala Ala 
595 600 



<210> 37 
<211> 1929 
<212> DNA 

<213> Exophiala spinifera 
<220> 

<221> intron 

<222> (739) . . . (811) 

<221> intron 

<222> (1134) . . . (1186) 



<400> 37 

atggcacttg caccgagcta catcaatccc ccaaacgtcg cctccccagc agggtattcc 60 

cacatcggcg taggcccaaa cgaagcgagg tatgtgacaa tagctggaca gattggacaa 120 

gacgctttgg gcgtgacaga cccagcctac gagaaacagg ttgcccaagc attcgccaat 180 

ctgcgagctt gccttgctgc agttggagcc tcttcaaacg acgtcaccaa gctcaattac 240 

tacatcgtcg actacgcccc gagcaaactc accgcaattg gagatgggct gaagtctacc 300 

tttgcccttg acaggctccc tccttgcacg ctggtgccag taccggcctt ggcttcacct 360 

gaatacctct ttgaggttga cgccacggcg ctggtgccag gacactcgac cccagacaac 420 

gttgcggacg tggtagtggt gggcgctggc ttgagcggct tggagacggc acgcaaagtc 480 

caggccgccg gtctgtcctg cctcgttctt gaggcgatgg atcgtgtagg gggaaagact 540 

ctgagcgtac aatcgggtcc cggcaggacg actatcaacg acctcggcgc tgcgtggatc 600 

aatgacagca accaaagcga agtatccaga ttgtttgaaa gatttcattt ggagggcgag 660 

ctccagagga cgaccggaaa ttcaatccat caagcacaag acggtacaac cactacagct 720 

ccttatggtg actccccggt aagcacaatc ccactttgtg atgagacctc tgtcgagtgt 780 

agaatacagt cactgactcc acttcgtcca gctgagcgag gaggttgcaa gtgcacttgc 84 0 

ggaactcctc cccgtatggt ctcagctgat cgaagagtat agccttgaag accccaaggc 9 00 

gagccctcag gcgaagcggc tcgacagtgt gagcttcgcg cactactgtg agaaggacct 960 

aaacttgcct gctgttctca gcgtggcaaa ccagatcaca cgcgctctgc tcggtgtgga 1020 

agcccacgag atcagcatgc tttttctcac cgactacatc aagagtgcca ccggtctcag 1080 

taatattgtc tcggacaaga aagacggcgg gcagtatatg cgatgcaaaa caggtgcgtg 1140 

cggtgtcctc tcaggtaggg gactcgtttc ttagtggtca ttccaggtat gcagtcgatt 1200 

tgccatgcca tgtcaaagga acttgttcca ggctcagtgc acctcaacac ccccgtcgct 126 0 

ggaattgagc agtcggcgtc cggctgtata gtacgatcgg cctcgggcgc cgtgttccga 132 0 

agcaaaaagg tggtggtttc gttaccgaca acattgtatc ccaccttgac atttt caeca 1380 

cctcttcccg ccgagaagca agcattggcg gaaaaatcta tcctcggcta ctatagcaag 144 0 

atagtcttcg tatgggacaa cccgtggtgg cgcgaacaag gettcteggg cgtcctccaa 15 00 

tcgagctgtg accccatctc atttgecaga gataccagca tcgaagtcga teggcaatgg 1560 

tccattacct gtttcatggt cggagacccg ggacggaagt ggtcccaaca gtccaagcag 1620 

gtacgacaaa agtctgtctg ggaccaactc cgcgcagcct aegagaaege cggggcccaa 168 0 

gtcccagagc cggccaacgt gctcgaaatc gagtggtcga agcagcagta tttccaagga 174 0 

gctccgagcg ccgtctatgg gctgaacgat ctcatcacac tgggttcggc gctcagaacg 1800 

ccgttcaagt gtgttcattt cgttggaacg gagaegtett tagtttggaa agggtatatg 1860 

gaaggggeca tacgateggg tcaacgaggt getgeagaag ttgtggctag cctggtgcca 192 0 

gcagcatag 1929 

<210> 38 
<211> 600 
<212> PRT 

<213> Exophiala spinifera 



<400> 38 
Met Ala Leu Ala Pro Ser Tyr lie 

1 5 
Ala Gly Tyr Ser His lie Gly Val 
20 



Asn Pro Pro Asn Val Ala Ser Pro 

10 15 
Gly Pro Asn Glu Ala Arg Tyr Val 
25 30 
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Thr lie Ala Gly 
35 

Ala Tyr Glu Lys 
50 

Leu Ala Ala Val 
65 

Tyr lie Val Asp 

Leu Lys Ser Thr 
100 

Pro Val Pro Ala 
115 

Thr Ala Leu Val 
130 

Val Val Val Gly 
145 

Gin Ala Ala Gly 

Gly Gly Lys Thr 
180 

Asn Asp Leu Gly 
195 

Ser Arg Leu Phe 
210 

Thr Gly Asn Ser 
225 

Pro Tyr Gly Asp 

Glu Leu Leu Pro 
260 

Asp Pro Lys Ala 
275 

Ala His Tyr Cys 
290 

Ala Asn Gin lie 
305 

Ser Met Leu Phe 

Asn lie Val Ser 
340 

Thr Gly Met Gin 
355 

Gly Ser Val His 
370 

Ser Gly Cys lie 
385 

Lys Val Val Val 

Ser Pro Pro Leu 
420 

Leu Gly Tyr Tyr 
435 

Arg Glu Gin Gly 
450 

Ser Phe Ala Arg 
465 

Thr Cys Phe Met 

Lys Gin Val Arg 
500 

Glu Asn Ala Gly 
515 



Gin lie Gly Gin 
40 

Gin Val Ala Gin 
55 

Gly Ala Ser Ser 
70 

Tyr Ala Pro Ser 
85 

Phe Ala Leu Asp 

Leu Ala Ser Pro 
12 0 

Pro Gly His Ser 
135 

Ala Gly Leu Ser 
150 

Leu Ser Cys Leu 
165 

Leu Ser Val Gin 

Ala Ala Trp lie 
200 

Glu Arg Phe His 
215 

lie His Gin Ala 
230 

Ser Pro Leu Ser 
245 

Val Trp Ser Gin 

Ser Pro Gin Ala 
280 

Glu Lys Asp Leu 
295 

Thr Arg Ala Leu 
310 

Leu Thr Asp Tyr 
325 

Asp Lys Lys Asp 

Ser lie Cys His 
360 

Leu Asn Thr Pro 
375 

Val Arg Ser Ala 
390 

Ser Leu Pro Thr 
405 

Pro Ala Glu Lys 

Ser Lys lie Val 
440 

Phe Ser Gly Val 
455 

Asp Thr Ser lie 
470 

Val Gly Asp Pro 
485 

Gin Lys Ser Val 

Ala Gin Val Pro 
520 



Asp Ala Leu Gly 

Ala Phe Ala Asn 
60 

Asn Asp Val Thr 
75 

Lys Leu Thr Ala 
90 

Arg Leu Pro Pro 
105 

Glu Tyr Leu Phe 

Thr Pro Asp Asn 
14 0 

Gly Leu Glu Thr 
155 

Val Leu Glu Ala 
170 

Ser Gly Pro Gly 
185 

Asn Asp Ser Asn 

Leu Glu Gly Glu 
220 

Gin Asp Gly Thr 
235 

Glu Glu Val Ala 
250 

Leu lie Glu Glu 
265 

Lys Arg Leu Asp 

Asn Leu Pro Ala 
300 

Leu Gly Val Glu 
315 

lie Lys Ser Ala 
330 

Gly Gly Gin Tyr 
345 

Ala Met Ser Lys 

Val Ala Gly He 
380 

Ser Gly Ala Val 
395 

Thr Leu Tyr Pro 
410 

Gin Ala Leu Ala 
425 

Phe Val Trp Asp 

Leu Gin Ser Ser 
460 

Glu Val Asp Arg 
475 

Gly Arg Lys Trp 
490 

Trp Asp Gin Leu 
505 

Glu Pro Ala Asn 

67 



Val Thr Asp Pro 
45 

Leu Arg Ala Cys 

Lys Leu Asn Tyr 
80 

He Gly Asp Gly 
95 

Cys Thr Leu Val 
110 

Glu Val Asp Ala 
125 

Val Ala Asp Val 

Ala Arg Lys Val 
160 

Met Asp Arg Val 
175 

Arg Thr Thr He 
190 

Gin Ser Glu Val 
205 

Leu Gin Arg Thr 

Thr Thr Thr Ala 
240 

Ser Ala Leu Ala 
255 

Tyr Ser Leu Glu 
270 

Ser Val Ser Phe 
285 

Val Leu Ser Val 

Ala His Glu He 
320 

Thr Gly Leu Ser 
335 

Met Arg Cys Lys 
350 

Glu Leu Val Pro 
365 

Glu Gin Ser Ala 

Phe Arg Ser Lys 
400 

Thr Leu Thr Phe 
415 

Glu Lys Ser He 
430 

Asn Pro Trp Trp 
445 

Cys Asp Pro He 

Gin Trp Ser He 
480 

Ser Gin Gin Ser 
495 

Arg Ala Ala Tyr 
510 

Val Leu Glu He 
525 



Glu Trp Ser Lys Gin 
530 

Gly Leu Asn Asp Leu 
545 

Lys Cys Val His Phe 
565 

Tyr Met Glu Gly Ala 
580 

Val Ala Ser Leu Val 
595 



Gin Tyr Phe Gin Gly Ala 
535 

lie Thr Leu Gly Ser Ala 
550 555 
Val Gly Thr Glu Thr Ser 
570 

lie Arg Ser Gly Gin Arg 
585 

Pro Ala Ala 
600 



Pro Ser Ala Val Tyr 
540 

Leu Arg Thr Pro Phe 
560 

Leu Val Trp Lys Gly 
575 

Gly Ala Ala Glu Val 
590 



<210> 39 
<211> 1930 
<212> DNA 

<213> Exophiala spinifera 
<220> 

<221> intron 

<222> (739) . . . (811) 

<221> intron 

<222> (1134) . . . (1187) 

<221> misc_f eature 
<222> (648) . . . (648) 
<223> n = A,T,C or G 



<400> 39 

atggcacttg caccgagcta catcaatccc ccaaacgtcg cctccccagc agggtattct 60 

cacgtcggcg taggcccaga cggagggagg tatgtgacaa tagctggaca gattggacaa 12 0 

gacgcttcgg gcgtgacaga ccctgcctac gagaaacagg ttgcccaagc attcgccaat 180 

ctgcgagctt gccttgctgc agttggagcc acttcaaacg acgtcaccaa gctcaattac 240 

tacatcgtcg actacgcccc gagcaaactc accgcaattg gagatgggct gaaggctacc 300 

tttgcccttg acaggctccc tccttgcacg ctggtgccag tgtcggcctt gtcttcacct 36 0 

gaatacctct ttgaggttga tgccacggcg ctggtgccgg gacacacgac cccagacaac 42 0 

gttgcggacg tggtagtggt gggcgctggc ttgagcggtt tggagacggc acgcaaagtc 480 

caggccgccg gtctgtcctg cctcgttctt gaggcgatgg atcgtgtagg gggaaagact 540 

ctgagcgtac aatcgggtcc cggcaggacg actatcaacg acctcggcgc tgcgtggatc 600 

aatgacagca accaaagcga agtatccaga ttgtttgaaa gatttcatnt ggagggcgag 66 0 

ctccagagga cgactggaaa ttcaatccat caagcacaag acggtacaac cactacagct 720 

ccttatggtg actccttggt aagcacaatc ccactttgtg atgagacctc tgtcgagtgt 780 

agaatacagt cactgattcc acttcgtcca gctgagcgag gaggttgcaa gtgcacttgc 84 0 

ggaactcctc cccgtatggt ctcagctgat cgaagagcat agccttcaag acctcaaggc 900 

gagccctcag gcgaagcggc tcgacagtgt gagcttcgcg cactactgtg agaaggaact 96 0 

aaacttgcct gctgttctcg gcgtagcaaa ccagatcaca cgcgctctgc tcggtgtgga 1020 

agcccacgag atcagcatgc tttttctcac cgactacatc aagagtgcca ccggtctcag 1080 

taatattttc tcggacaaga aagacggcgg gcagtatatg cgatgcaaaa caggtgcgtg 1140 

tggtgtcgtc tcaggtgggg gactcgtttc tcaagtggtc atttcaggta tgcagtcgat 1200 

ttgccatgcc atgtcaaagg aacttgttcc aggctcagtg cacctcaaca cccccgtcgc 1260 

tgaaattgag cagtcggcat ccggctgtac agtacgatcg gcctcgggcg ccgtgttccg 1320 

aagcaaaaag gtggtggttt cgttaccgac aaccttgtat cccaccttga cattttcacc 1380 

acctctcccc gccgagaagc aagcattggc ggaaaattct atcctgggct actatagcaa 1440 

gatagtcttc gtatgggaca agccgtggtg gcgcgaacaa ggcttctcgg gcgtcctcca 1500 

atcgagctgt gaccccatct catttgccag agataccagc atcgacgtcg atcgacaatg 1560 

gtccattacc tgtttcatgg tcggagaccc gggacggaag tggtcccaac agtccaagca 1620 

ggtacgacaa aagtctgtct gggaccaact ccgcgcagcc tacgagaacg ccggggccca 1680 

agtcccagag ccggccaacg tgctcgaaat cgagtggtcg aagcagcagt atttccaagg 174 0 

agctccgagc gccgtctatg ggctgaacga tctcatcaca ctgggttcgg cgctcagaac 1800 

gccgttcaag agtgttcatt tcgttggaac ggagacgtct ttagtttgga aagggtatat 1860 

ggaaggggcc atacgatcgg gtcaacgagg tgctgcagaa gttgtggcta gcctggtgcc 1920 

agcagcatag 1930 



68 



<210> 40 
<211> 598 
<212> PRT 

<213> Exophiala spinifera 
<220> 

<221> VARIANT 

<222> (216) . . • (216) 

<2 23> Xaa = Any Amino Acid 



<400> 40 
Met Ala Leu Ala 
1 

Ala Gly Tyr Ser 
20 

Thr He Ala Gly 
35 

Ala Tyr Glu Lys 
50 

Leu Ala Ala Val 
65 

Tyr He Val Asp 

Leu Lys Ala Thr 
100 

Pro Val Ser Ala 
115 

Thr Ala Leu Val 
130 

Val Val Gly Ala 
14 5 

Ala Ala Gly Leu 

Gly Lys Thr Leu 
180 

Asp Leu Gly Ala 
195 

Arg Leu Phe Glu 
210 

Gly Asn Ser He 
225 

Tyr Gly Asp Ser 

Leu Leu Pro Val 
260 

Leu Lys Ala Ser 
275 

His Tyr Cys Glu 
290 

Gin He Thr Arg 
305 

Leu Phe Leu Thr 

Phe Ser Asp Lys 
340 

Met Gin Ser He 
355 

Val His Leu Asn 
370 

Cys Thr Val Arg 
385 

Val Val Ser Leu 



Pro Ser Tyr He 
5 

His Val Gly Val 

Gin He Gly Gin 
40 

Gin Val Ala Gin 
55 

Gly Ala Thr Ser 
70 

Tyr Ala Pro Ser 
85 

Phe Ala Leu Asp 

Leu Ser Ser Pro 
120 

Pro Gly His Thr 
135 

Gly Leu Ser Gly 
150 

Ser Cys Leu Val 
165 

Ser Val Gin Ser 

Ala Trp He Asn 
200 

Arg Phe His Xaa 
215 

His Gin Ala Gin 
230 

Leu Leu Ser Glu 
245 

Trp Ser Gin Leu 

Pro Gin Ala Lys 
280 

Lys Glu Leu Asn 
295 

Ala Leu Leu Gly 
310 

Asp Tyr He Lys 
325 

Lys Asp Gly Gly 

Cys His Ala Met 
360 

Thr Pro Val Ala 
375 

Ser Ala Ser Gly 
390 

Pro Thr Thr Leu 



Asn Pro Pro Asn 
10 

Gly Pro Asp Gly 
25 

Asp Ala Ser Gly 

Ala Phe Ala Asn 
60 

Asn Asp Val Thr 
75 

Lys Leu Thr Ala 
90 

Arg Leu Pro Pro 
105 

Glu Tyr Leu Phe 

Thr Pro Asp Asn 
140 

Leu Glu Thr Ala 
155 

Leu Glu Ala Met 
170 

Gly Pro Gly Arg 
185 

Asp Ser Asn Gin 

Glu Gly Glu Leu 
220 

Asp Gly Thr Thr 
235 

Glu Val Ala Ser 
250 

He Glu Glu His 
265 

Arg Leu Asp Ser 

Leu Pro Ala Val 
300 

Val Glu Ala His 
315 

Ser Ala Thr Gly 
330 

Gin Tyr Met Arg 
345 

Ser Lys Glu Leu 

Glu He Glu Gin 
380 

Ala Val Phe Arg 
395 

Tyr Pro Thr Leu 

69 



Val Ala Ser Pro 
15 

Gly Arg Tyr Val 
30 

Val Thr Asp Pro 
45 

Leu Arg Ala Cys 

Lys Leu Asn Tyr 
80 

He Gly Asp Gly 
95 

Cys Thr Leu Val 
110 

Glu Val Asp Ala 
125 

Val Ala Asp Val 

Arg Lys Val Gin 
160 

Asp Arg Val Gly 
175 

Thr Thr He Asn 
190 

Ser Glu Val Ser 
205 

Gin Arg Thr Thr 

Thr Thr Ala Pro 
240 

Ala Leu Ala Glu 
255 

Ser Leu Gin Asp 
270 

Val Ser Phe Ala 
285 

Leu Gly Val Asn 

Glu He Ser Met 
320 

Leu Ser Asn He 
335 

Cys Lys Thr Gly 
350 

Val Pro Gly Ser 
365 

Ser Ala Ser Gly 

Ser Lys Lys Val 
400 

Thr Phe Ser Pro 









405 










410 










415 




Pro Leu 


Pro 


Ala 


Glu 


Lys 


Gin 


Ala 


Leu 


Ala 


Glu 


Asn 


Ser 


He 


Leu 


Gly 






420 










425 










430 






Tyr Tyr 


Ser 


Lys 


He 


Val 


Phe 


Val 


Trp 


Asp 


Lys 


Pro 


Trp 


Trp 


Arg 


Glu 




435 










440 










445 








Gin Gly 


Phe 


Ser 


Gly 


Val 


Leu 


Gin 


Ser 


Ser 


Cys 


Asp 


Pro 


He 


Ser 


Phe 


450 










455 










460 










Ala Arg 


Asp 


Thr 


Ser 


He 


Asp 


Val 


Asp 


Arg 


Gin 


Trp 


Ser 


He 


Thr 


Cys 


465 








470 










475 










480 


Phe Met 


Val 


Gly 


Asp 


Pro 


Gly 


Arg 


Lys 


Trp 


Ser 


Gin 


Gin 


Ser 


Lys 


Gin 








485 










490 










495 




Val Arg 


Gin 


Lys 


Ser 


Val 


Trp 


Asp 


Gin 


Leu Arg 


Ala 


Ala 


Tyr 


Glu 


Asn 






500 










505 










510 






Ala Gly 


Ala 


Gin 


Val 


Pro 


Glu 


Pro 


Ala 


Asn 


Val 


Leu 


Glu 


He 


Glu 


Trp 




515 










520 










525 








Ser Lys 


Gin 


Gin 


Tyr 


Phe 


Gin 


Gly 


Ala 


Pro 


Ser 


Ala 


Val 


Tyr 


Gly 


Leu 


530 










535 










540 










Asn Asp 


Leu 


He 


Thr 


Leu 


Gly 


Ser 


Ala 


Leu Arg 


Thr 


Pro 


Phe 


Lys 


Ser 


545 








550 










555 










560 


Val His 


Phe 


Val 


Gly 


Thr 


Glu 


Thr 


Ser 


Leu 


Val 


Trp 


Lys 


Gly 


Tyr 


Met 








565 










570 










575 




Glu Gly 


Ala 


He 


Arg 


Ser 


Gly 


Gin 


Arg 


Gly Ala 


Ala 


Glu 


Val 


Val 


Ala 






580 










585 










590 






Ser Leu 


Val 


Pro 


Ala 


Ala 























595 

<210> 41 
<211> 1928 
<212> DNA 

<213> Rhinocladiella atrovirens 
<220> 

<221> intron 

<222> (739) . . . (811) 

<221> intron 

<222> (1134) . . . (1185) 



<400> 41 

atggcacttg caccgagcta catcaatccc ccaaacctcg cctccccagc agggtattcc 6 0 

cacgtcggcg taggcccaaa cggagggagg tatgcgacaa tagctggaca gattggacaa 12 0 

gacgcttcgg ccgtgacaga ccctgcctac gagaaacagg ttgcccaagc attcgccaac 180 

ctgcgagctt gtcttgctgc agttggagcc acttcaaacg acattaccaa gctcaattac 240 

tacatcgtcg actacaaccc gagcaaactc accgcaattg gagatgggct gaaggctacc 3 00 

tttgcccttg acaggctccc tccttgcacg ctggtgccag tgccggccct ggcttcacct 36 0 

gaatacccct ttgaggttga tgccacggcg ctggttccag gacactcaac cccagacaat 420 

gttgcggacg tggtcgtggt gggcgctggc ttgagcggtt tggagacggc acgcaaagtc 4 80 

caggctgccg ggctgtcctg cctcgttctt gaggcgatgg atcgtgtggg gggaaagact 54 0 

ctgagcgtac aatcgggtcc cggcaggacg gctatcaatg acctcggcgc tgcgtggatc 600 

aatgacagca accaaagcga agtattcaaa ttatttgaaa gatttcattt ggagggcgag 66 0 

ctccagagga cgaccggaaa ttcaatccat caagcacaag acggtacaac cactacagct 720 

ccttatggtg attccctggt aagcacaatt ccatcttgtg atgagacctc tgtcgtgtgt 780 

agaatacagt cgctgactcc acatcgtcca gctgagcgag gaggttgcaa gtgcactcgc 840 

ggaactcctt cccgcatggt ctcagctgat cgaagagcat agtcttgaag accccaaggc 900 

gagccctcaa gcgaagcagc tcgacagtgt gagcttcgca cactactgtg agaaggatct 960 

aagcttgcct gctgttctcg gcgtggcaaa ccagatcaca cgcgctctgc tcggtgtgga 1020 

agcccacgag atcagcatgc tttttctcac cgactacatc aagagtgcca ccggtctcag 1080 

taatattgtc tcggataaga aagacggtgg gcagtatatg cgatgcaaaa caggtgcgtg 1140 

tggtgttctc tcagtgggag actcgtttct tagtggtcat tccaggtatg cagtcgcttt 12 00 

gccatgccat gtcaaaggaa cttgttccag gctcagtgca cctcaacacc cccgtcgccg 126 0 

aaattgagca gtcggcatcc ggctgtacag tacgatcggc ctcgggcggc gtgttccgaa 132 0 

gtaaaaaggt ggtggtttcg ttaccgacaa ccttgtatcc caccttgata ttttcaccac 13 80 
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ctcttcccgc cgagaagcaa gcattggctg aaaaatccat cctgggctac tatagcaaga 1440 

tagtcttcgt atgggacaag ccgtggtggc gcgaacaagg cttctcgggc gtcctccaat 1500 

cgagctgtga ccccatctca tttgccagag ataccagcat cgaagtcgat cggcaatggt 1560 

ccattacctg tttcatggtc ggagacccgg gacggaagtg gtcccaacag tccaagcagg 1620 

tacgacagaa gtctgtctgg aaccaactcc gcgcagccta cgagaacgcc ggggcccaag 1680 

tcccagagcc ggccaacgtg ctcgagatcg agtggtcgaa gcagcagtat ttccaaggag 174 0 

cgccgagcgt cgtctatggg ctgaactgtc tcaacacact gggttcggcg ctcagaacgc 1800 

cgttcaaggg tgttcatttc gttggaacgg agacgtcttt ggtttggaaa gggtatatgg 1860 

aaggggccat acgatcgggt cagcgaggcg ctgcagaagt tgtggctagc ctggtgccag 1920 

cagcatag 1928 



<210> 42 
<211> 598 
<212> PRT 

<213> Rhinocladiella atrovirens 



<400> 42 
Met Ala Leu Ala 
1 

Ala Gly Tyr Ser 
20 

Thr He Ala Gly 
35 

Ala Tyr Glu Lys 
50 

Leu Ala Ala Val 
65 

Tyr He Val Asp 

Leu Lys Ala Thr 
100 

Pro Val Pro Ala 
115 

Thr Ala Leu Val 
130 

Val Val Val Gly 
145 

Gin Ala Ala Gly 

Gly Gly Lys Thr 
180 

Asn Asp Leu Gly 
195 

Phe Lys Leu Phe 
210 

Asn Ser He His 
225 

Gly Asp Ser Leu 

Leu Pro Ala Trp 
260 

Lys Ala Ser Pro 
275 

Tyr Cys Glu Lys 
290 

Gin He Thr Arg 
305 

Leu Phe Leu Thr 

Val Ser Asp Lys 
340 

Met Gin Ser Leu 



Pro Ser Tyr He 
5 

His Val Gly Val 

Gin He Gly Gin 
40 

Gin Val Ala Gin 
55 

Gly Ala Thr Ser 
70 

Tyr Asn Pro Ser 
85 

Phe Ala Leu Asp 

Leu Ala Ser Pro 
120 

Pro Gly His Ser 
135 

Ala Gly Leu Ser 
150 

Leu Ser Cys Leu 
165 

Leu Ser Val Gin 

Ala Ala Trp He 
200 

Glu Arg Leu Glu 
215 

Gin Ala Gin Asp 
230 

Leu Ser Glu Glu 
245 

Ser Gin Leu He 

Gin Ala Lys Gin 
280 

Asp Leu Ser Leu 
295 

Ala Leu Leu Gly 
310 

Asp Tyr He Lys 
325 

Lys Asp Gly Gly 
Cys His Ala Met 



Asn Pro Pro Asn 
10 

Gly Pro Asn Gly 
25 

Asp Ala Ser Ala 

Ala Phe Ala Asn 
60 

Asn Asp He Thr 
75 

Lys Leu Thr Ala 
90 

Arg Leu Pro Pro 
105 

Glu Tyr Pro Phe 

Thr Pro Asp Asn 
140 

Gly Leu Glu Thr 
155 

Val Leu Glu Ala 
170 

Ser Gly Pro Gly 
185 

Asn Asp Ser Asn 

Gly Glu Leu Gin 
220 

Gly Thr Thr Thr 
235 

Val Ala Ser Ala 
250 

Glu Glu His Ser 
265 

Leu Asp Ser Val 

Pro Ala Val Leu 
300 

Val Glu Ala His 
315 

Ser Ala Thr Gly 
330 

Gin Tyr Met Arg 
345 

Ser Lys Glu Leu 
71 



Leu Ala Ser Pro 
15 

Gly Arg Tyr Ala 
30 

Val Thr Asp Pro 
45 

Leu Arg Ala Cys 

Lys Leu Asn Tyr 
80 

He Gly Asp Gly 
95 

Cys Thr Leu Val 
110 

Glu Val Asp Ala 
125 

Val Ala Asp Val 

Ala Arg Lys Val 
160 

Met Asp Arg Val 
175 

Arg Thr Ala He 
190 

Gin Ser Glu Val 
205 

Arg Thr Thr Gly 

Thr Ala Pro Tyr 
240 

Leu Ala Glu Leu 
255 

Leu Glu Asp Pro 
270 

Ser Phe Ala His 
285 

Gly Val Ala Asn 

Glu He Ser Met 
320 

Leu Ser Asn He 
335 

Cys Lys Thr Gly 
350 

Val Pro Gly Ser 



355 

Val His Leu Asn 
370 

Cys Thr Val Arg 
385 

Val Val Ser Leu 

Pro Leu Pro Ala 
420 

Tyr Tyr Ser Lys 
435 

Gin Gly Phe Ser 
450 

Ala Arg Asp Thr 
465 

Phe Met Val Gly 

Val Arg Gin Lys 
500 

Ala Gly Ala Gin 
515 

Ser Lys Gin Gin 
530 

Asn Cys Leu Asn 
545 

Val His Phe Val 

Glu Gly Ala He 
580 

Ser Leu Val Pro 
595 



360 

Thr Pro Val Ala 
375 

Ser Ala Ser Gly 
390 

Pro Thr Thr Leu 
405 

Glu Lys Gin Ala 

He Val Phe Val 
440 

Gly Val Leu Gin 
455 

Ser He Glu Val 
470 

Asp Pro Gly Arg 
485 

Ser Val Trp Asn 

Val Pro Glu Pro 
520 

Tyr Phe Gin Gly 
535 

Thr Leu Gly Ser 
550 

Gly Thr Glu Thr 
565 

Arg Ser Gly Gin 
Ala Ala 



Glu 


He 


Glu 


Gin 








380 


Gly 


Val 


Phe 


Arg 






395 




Tyr 


Pro 


Thr 


Leu 




410 






Leu 


Ala 


Glu 


Lys 


425 








Trp 


Asp 


Lys 


Pro 


Ser 


Ser 


Cys 


Asp 








a n 


Asp 


Arg 


Gin 


Trp 






475 




Lys 


Trp 


Ser 


Gin 




490 






Gin 


Leu 


Arg 


Ala 


505 








Ala 


Asn 


Val 


Leu 


Ala 


Pro 


Ser 


Val 








540 


Ala 


Leu 


Arg 


Thr 






555 




Ser 


Leu 


Val 


Trp 




570 






Arg 


Gly Ala 


Ala 



585 



365 

Ser Ala Ser Gly 

Ser Lys Lys Val 
400 

He Phe Ser Pro 
415 

Ser He Leu Gly 
430 

Trp Trp Arg Glu 
445 

Pro He Ser Phe 

Ser He Thr Cys 
480 

Gin Ser Lys Gin 
495 

Ala Tyr Glu Asn 
510 

Glu He Glu Trp 
525 

Val Tyr Gly Leu 

Pro Phe Lys Gly 
560 

Lys Gly Tyr Met 
575 

Glu Val Val Ala 
590 



<210> 43 

<211> 1928 

<212> DNA 

<213> Rhinocladiella atrovirens 



<220> 

<221> intron 

<222> (739) . . . (811) 

<221> intron 

<222> (1134) . . . (1186) 



<400> 43 

atggcacttg caccgagcta catcaatccc ccaaacctcg cctccccagc agggtattcc 60 

tacgtcggcg taggcccaaa cggagggagg tatgtgacaa tagctggaca gattggacaa 120 

gacgcttcgg ccgtgacaga ccctgcctac gagaaacagg ttgcccaagc attcgccaac 18 0 

ctgcgagctt gtcttgctgc agttggagcc acttcaaacg acattaccaa gctcaattac 24 0 

tacatcgtcg actacaaccc gagcaaactc accgcaattg gagatgggct gaaggctacc 3 00 

tttgcccttg acaggctccc tccttgcacg ctggtgccag tgccggccct ggcttcacct 3 60 

gaatacctct ttgaggttga tgccacggcg ctggttccag gacactcaac cccagacaat 420 

gttgcggacg tggtcgtggt gggcgctggc ttgagcggtt tggagacggc acgcaaagtc 480 

caggctgccg ggctgtcctg cctcgttctt gaggcgatgg atcgtgtggg gggaaagact 54 0 

ctgagcgtac aatcgggtcc cggcaggacg actatcaatg acctcggcgc tgcgtggatc 600 

aatgacagca accaaagcga agtattcaaa ttatttgaaa gatttcattt ggagggcgag 6 60 

ctccagagga cgaccggaaa ttcaatccat caagcacaag acggtacaac cactacagct 720 

ccttatggtg attccctggt aagcacaatt ccatcttgtg atgagacctc tgtcgtgtgt 780 

agaatacagt cgctgactcc acatcgtcca gctgagcgag gaggttgcaa gtgcactcgc 84 0 

ggaactcctt cccgcatggt ctcagctgat cgaagagcat agtcttgaag accccaaggc 900 

gagccctcaa gcgaagcagc tcgacagtgt gagcttcgca cactactgtg agaaggatct 960 

aaacttgcct gctgttctcg gcgtggcaaa ccagatcaca cgcgctctgc tcggtgtgga 1020 
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1928 



agcccacgag atcagcatgt tttttctcac cgactacatc aagagtgcca ccggtctcag 1080 

taatattgtc tcggataaga aagacggtgg gcagtatatg cgatgcaaaa caggtgcgtg 1140 

tggtgttctc tcagtgggag actcgtttct tagtggtcat tccaggtatg cagtcgcttt 1200 

gccatgccat gtcaaaggaa cttgttccag gctcagtgca cctcaacacc cccgtcgccg 126 0 

aaattgagca gtcggcatcc ggctgtacag tacgatcggc ctcgggcggc gtgttccgaa 132 0 

gtaaaaaggt ggtggtttcg ttaccgacaa ccttgtatcc caccttgata ttttcaccac 1380 

ctcttcccgc cgagaagcaa gcattggctg aaaaatccat cctgggctac tatagcaaga 1440 

tagtcttcgt atgggacaag ccgtggtggc gcgaacaagg cttctcgggc gtcctccaat 150 0 

cgagctgtga ccccatctca tttgccagag ataccagcat cgaagtcgat cggcaatggt 156 0 

ccattacctg tttcatggtc ggagacccgg gacggaagtg gtcccaacag tccaagcagg 162 0 

tacgacagaa gtctgtctgg aaccaactcc gcgcagccta cgagaacgcc ggggcccaag 16 8 0 

tcccagagcc ggccaacgtg ctcgagatcg agtggtcgaa gcagcagtat ttccaaggag 1740 

cgccgagcgc cgtctatggg ctgaactgtc tcaacacact gggttcggcg ctcagaacgc 18 00 

cgttcaaggg tgttcatttc gttggaacgg agacgtcttt ggtttggaaa gggtatatgg 1860 

aaggggccat acgatcgggt cagcgaggcg ctgcagaagt tgtggctagc ctggtgccag 192 0 
cagcatag 

<210> 44 
<211> 591 
<212> PRT 

<213> Rhinocladiella atrovirens 
<400> 44 

Met Ala Leu Ala Pro Ser Tyr He Asn Pro Pro Asn Leu Ala Ser Pro 

15 10 15 

Ala Gly Tyr Ser Tyr Val Gly Val Gly Pro Asn Gly Gly Arg Tyr Val 

20 25 30 

Thr He Ala Gly Gin He Gly Gin Asp Ala Ser Ala Val Thr Asp Pro 

35 40 45 

Ala Tyr Glu Lys Gin Val Ala Gin Ala Phe Ala Asn Leu Arg Ala Cys 

50 55 60 

Leu Ala Ala Val Gly Ala Thr Ser Asn Asp He Thr Lys Leu Asn Tyr 
65 70 75 80 

Tyr He Val Asp Tyr Asn Pro Ser Lys Leu Thr Ala He Gly Asp Gly 

85 90 95 

Leu Lys Ala Thr Phe Ala Leu Asp Arg Leu Pro Pro Cys Thr Leu Val 

100 105 HO 

Pro Val Pro Ala Leu Ala Ser Pro Glu Tyr Leu Phe Glu Val Asp Ala 

115 120 125 

Thr Ala Leu Val Pro Gly His Ser Thr Pro Asp Asn Val Ala Asp Val 

130 135 140 

Val Val Val Gly Ala Gly Leu Ser Gly Leu Glu Thr Ala Arg Lys Val 
145 150 155 160 

Gin Ala Ala Gly Leu Ser Cys Leu Val Leu Glu Ala Met Asp Arg Val 

165 170 175 

Gly Gly Lys Thr Leu Ser Val Gin Ser Gly Gly Arg Thr Thr He Asn 

180 185 190 

Asp Leu Gly Ala Ala Trp He Asn Asp Ser Asn Gin Ser Glu Val Lys 

195 200 205 

Leu Phe Glu Arg Phe His Leu Glu Gly Glu Leu Gin Arg Thr Thr Gly 

210 215 220 

Asn Ser He His Gin Ala Gin Asp Gly Thr Thr Thr Thr Ala Pro Tyr 
225 230 235 240 

Gly Ser Leu Leu Ser Glu Glu Val Ala Ser Ala Leu Ala Glu Leu Leu 

245 250 255 

Pro Ala Ser Gin Leu He Glu Glu His Ser Leu Glu Asp Pro Lys Ala 

260 265 270 

Ser Pro Gin Ala Lys Gin Leu Asp Ser Val Ser Phe Ala His Tyr Cys 

275 280 285 

Glu Lys Leu Asn Leu Ala Val Leu Gly Val Ala Asn Gin He Thr Arg 

290 295 300 

Ala Leu Leu Gly Val Glu Ala His Glu He Ser Met Phe Phe Leu Thr 
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305 

Asp Tyr lie Lys 

Lys Asp Gly Gly 
340 

Cys His Ala Met 
355 

Thr Pro Val Ala 
370 

Ser Ala Ser Gly 
385 

Leu Tyr Pro Thr 

Ala Leu Ala Glu 
420 

Val Trp Asp Lys 
435 

Gin Ser Ser Cys 
450 

Val Asp Arg Gin 
465 

Arg Lys Trp Ser 

Asn Gin Leu Arg 
500 

Pro Ala Asn Val 
515 

Gly Ala Pro Ser 
530 

Ser Ala Leu Arg 
545 

Thr Ser Leu Val 

Gin Arg Gly Ala 
580 



310 

Ser Ala Thr Gly 
325 

Gin Tyr Met Arg 

Ser Lys Glu Leu 
360 

Glu lie Glu Gin 
375 

Gly Val Phe Arg 
390 

Leu lie Phe Ser 
405 

Lys Ser lie Leu 

Pro Trp Trp Arg 
440 

Asp Pro lie Ser 
455 

Trp Ser lie Thr 

470 

Gin Gin Ser Lys 
485 

Ala Ala Tyr Glu 

Leu Glu lie Glu 
520 

Ala Val Tyr Gly 
535 

Thr Pro Phe Lys 
550 

Trp Lys Gly Tyr 
565 

Ala Glu Val Val 



315 



Leu 


Ser 


Asn 


He 




330 






Cys 


Lys 


Thr 


Gly 


345 








Val 


Pro 


Gly Ser 


Ser 


Ala 


Ser 










Ton 
•J o u 


Ser 


Lys 


Lys 


Val 






395 




Pro 


Pro 


Leu 


Pro 




410 






Gly 


Tyr 


Tyr 


Ser 


425 








Glu 


Gin Gly 


Phe 


Phe 


Ala 


Arg 


Asp 








460 


Cys 


Phe 


Met 


Val 






475 




Gin 


Val 


Arg 


Gin 




490 






Asn 


Ala 


Gly Ala 


505 








Trp 


Ser 


Lys 


Gin 


Leu 


Asn 


Cys 


Leu 








540 


Gly 


Val 


His 


Phe 






555 




Met 


Glu 


Gly Ala 




570 






Ala 


Ser 


Leu 


Val 



585 



320 

Val Ser Asp Lys 
335 

Met Gin Ser Leu 
350 

Val His Leu Asn 
365 

Cys Thr Val Arg 

Val Leu Pro Thr 
400 

Ala Glu Lys Gin 
415 

Lys He Val Phe 
430 

Ser Gly Val Leu 
445 

Thr Ser He Glu 

Gly Asp Pro Gly 
480 

Lys Ser Val Trp 
495 

Gin Val Pro Glu 
510 

Gin Tyr Phe Gin 
525 

Asn Thr Leu Gly 

Val Gly Thr Glu 
560 

He Arg Ser Gly 
575 

Pro Ala Ala 
590 



<210> 45 
<211> 1928 
<212> DNA 

<213> Rhinocladiella atrovirens 



<220> 

<221> intron 

<222> (739) . . . (811) 

<221> intron 

<222> (1134) . . . (1185) 



<400> 45 

atggcacttg caccgagcta catcaat ccc ccaaacctcg cctccccagc agggtattcc 60 

cacgtcggcg taggcccaaa cggagggagg tatgtgacaa tagctggaca gattggacaa 12 0 

gacgcttcgg ccgtgacaga ccctgcctac gagaaacagg ttgcccaagc attcgccaac 18 0 

ctgcgagctt gtcttgctgc agttggagcc acttcaaacg acattaccaa gctcaattac 24 0 

tacatcgtcg actacaaccc gagcaaactc accgcaattg gagatgggct gaaggctacc 3 00 

tttgcccttg acaggctccc tccttgcacg ctggtgccag tgccggccct ggcttcacct 360 

gaatacctct ttgaggttga tgctacggcg ctggttccag gacactcaac cccagacaat 420 

gttgcggacg tggtcgtggt gggcgctggc ttgagcggtt tggagacggc acgcaaagtc 480 

caggctgccg ggctgtcctg cctcgttctt gaggcgatgg atcgtgtggg gggaaagact 54 0 

ctgagcgtac aatcgggtcc cggcaggacg actatcaatg acctcggcgc tgcgtggatc 6 00 

aatgacagca accaaagcga agtattcaaa ttatttgaaa gatttcattt ggagggcgag 660 

ctccagagga cgaccggaaa ttcaatccat caagcacaag acggtacaac cactacagct 72 0 

ccttatggtg attccctggt aggcacaatt ccatcttgtg atgagacctc tgtcgtgtgt 780 
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agaatacagt cgctgactcc acatcgtcca gctgagcgag gaggttgcaa gtgcactcgc 840 

ggaactcctt cccgcatggt ctcagctgat cgaagagcat agtcttgaag accccaaggc 900 

gagccctcaa gcgaagcagc tcgacagtgt gagcttcgca cactactgtg agaaggatct 960 

aaacttgcct gctgttctcg gcgtggcaaa ccagatcaca cgcgctctgc tcggtgtgga 102 0 

agcccacgag atcagcatgc tttttctcac cgactacatc aagagtgcca ccggtctcag 1080 

taatattgtc tcggataaga aagacggtgg gcagtatatg cgatgcaaaa caggtgcgtg 1140 

tggtgttctc tcagtgggag actcgtttct tagtggtcat tccaggtatg cagtcgcttt 1200 

gccatgccat gtcaaaggaa cttgttccag gctcagtgca cctcaacacc cccgtcgccg 126 0 

aaattgagca gtcggcatcc ggctgtacag tacgatcggc ctcgggcggc gtgttccgaa 132 0 

gtaaaaaggt ggtggtttcg ttaccgacaa ccttgtatcc caccttgata ttttcaccac 1380 

ctcttcccgc cgagaagcaa gcattggctg aaaaatccat cctgggctac tatagcaaga 1440 

tagtcttcgt atgggacaag ctgtggtggc gcgaacaagg cttctcgggc gtcctccaat 1500 

cgagctgtga ccccatctca tttgccagag ataccagcat cgaagtcgat cggcaatggt 1560 

ccattacctg tttcatggtc ggagacccgg gacggaagtg gtcccaacag tccaagcagg 162 0 

tacgacagaa gtctgtctgg aaccaactcc gcgcagccta cgagaacgcc ggggcccaag 1680 

tcccagagcc ggccaacgtg ctcgagatcg agtggtcgaa gcagcagtat ttccaaggag 1740 

cgccgagcgc cgtctatggg ctgaactgtc tcaacacact gggttcggcg ctcagaacgc 18 0 0 

cgttcaaggg tgttcatttc gttggaacgg agacgtcttt ggtttggaaa gggtatatgg 1860 

aaggggccat acgatcgggt cagcgaggcg ctgcagaagt tgtgcctagc ctggtgccag 1920 

, 1928 
cagcatag 

<210> 46 
<211> 591 
<212> PRT 

<213> Rhinocladiella atrovirens 
<400> 46 

Met Ala Leu Ala Pro Ser Tyr He Asn Pro Pro Asn Ala Ser Pro Ala 

15 10 15 

Gly Tyr Ser His Val Gly Val Gly Pro Asn Gly Gly Arg Tyr Val Thr 

20 25 30 

He Ala Gly Gin He Gly Gin Asp Ala Ser Ala Val Thr Asp Pro Ala 

35 40 45 

Tyr Glu Lys Gin Val Ala Gin Ala Phe Ala Asn Leu Arg Ala Cys Leu 

50 55 60 

Ala Ala Val Gly Ala Thr Ser Asn Asp He Thr Lys Leu Asn Tyr Tyr 
65 70 75 80 

He Val Asp Tyr Asn Pro Ser Lys Leu Thr Ala He Gly Asp Gly Leu 

85 90 95 

Lys Ala Thr Phe Ala Leu Asp Arg Leu Pro Pro Cys Thr Leu Val Pro 

100 105 HO 

Val Pro Ala Leu Ala Ser Pro Glu Tyr Leu Phe Glu Val Asp Ala Thr 

115 120 125 

Ala Leu Val Pro Gly His Ser Thr Pro Asp Asn Val Ala Asp Val Val 

130 135 140 

Val Val Gly Ala Gly Leu Ser Gly Leu Glu Thr Ala Arg Lys Val Gin 
145 150 155 160 

Ala Ala Gly Leu Ser Cys Leu Val Leu Glu Ala Met Asp Arg Val Gly 

165 170 175 

Gly Lys Thr Leu Ser Val Gin Ser Gly Pro Gly Arg Thr Thr He Asn 

180 185 190 

Asp Leu Gly Ala Ala Trp He Asn Asp Ser Asn Gin Ser Glu Val Phe 

195 200 205 

Lys Leu Phe Glu Arg Phe His Leu Glu Gly Glu Leu Gin Arg Thr Thr 

210 215 220 

Gly Asn Ser He His Gin Ala Gin Asp Gly Thr Thr Thr Thr Ala Pro 
225 230 235 240 

Tyr Gly Asp Ser Leu Leu Ser Glu Glu Val Ala Ser Ala Leu Ala Glu 

245 250 255 

Leu Leu Pro Ala Trp Ser Gin Leu He Glu Glu His Ser Leu Glu Asp 

260 265 270 

Pro Lys Ala Ser Pro Gin Ala Lys Gin Leu Asp Ser Val Ser Phe Ala 

75 



275 

His Tyr Cys Glu 
290 

Asn Gin lie Thr 
305 

Met Leu Phe Leu 

lie Val Ser Asp 
340 

Gly Met Gin Ser 
355 

Ser Val His Leu 
370 

Gly Cys Thr Val 
385 

Val Ser Leu Pro 

Pro Ala Glu Lys 
420 

Lys lie Val Phe 
435 

Gly Val Leu Gin 
450 

Ser He Glu Val 
465 

Pro Arg Lys Trp 

Trp Asn Gin Leu 
500 

Glu Pro Ala Asn 
515 

Gin Ala Pro Ser 
530 

Ser Ala Leu Arg 
545 

Thr Ser Leu Val 

Gin Arg Gly Ala 
580 



280 

Lys Asp Leu Asn 
295 

Arg Ala Leu Leu 

310 

Thr Asp Tyr He 
325 

Lys Lys Asp Gly 

Leu Cys His Ala 
360 

Asn Thr Pro Val 
375 

Arg Ser Ala Ser 
390 

Thr Thr Leu Tyr 
405 

Gin Ala Leu Ala 

Val Asp Lys Leu 
440 

Ser Ser Cys Asp 
455 

Asp Arg Gin Ser 
470 

Ser Gin Gin Ser 
485 

Arg Ala Ala Tyr 

Val Leu Glu He 
520 

Ala Val Tyr Gly 
535 

Thr Pro Phe Lys 
550 

Trp Lys Gly Tyr 
565 

Ala Glu Val Val 



Leu Pro Ala Val 
300 

Gly Val Glu Ala 
315 

Lys Ser Ala Thr 
330 

Gly Gin Tyr Met 
345 

Met Ser Lys Glu 

Ala Glu He Glu 
380 

Gly Gly Val Phe 
395 

Pro Thr Leu He 
410 

Glu Lys Ser He 
425 

Trp Trp Arg Glu 

Pro He Ser Phe 
460 

He Thr Cys Phe 
475 

Lys Gin Val Arg 
490 

Glu Asn Ala Gly 
505 

Glu Trp Ser Lys 

Leu Asn Cys Leu 
540 

Gly Val His Phe 
555 

Met Glu Gly Ala 
570 

Pro Ser Leu Val 
585 



285 

Leu Gly Val Ala 

His Glu He Ser 
320 

Gly Leu Ser Asn 
335 

Arg Cys Lys Thr 
350 

Leu Val Pro Gly 
365 

Gin Ser Ala Ser 

Arg Ser Lys Lys 
400 

Phe Ser Pro Leu 
415 

Gly Tyr Tyr Ser 
430 

Gin Gly Phe Ser 
445 

Ala Arg Asp Thr 

Met Val Gly Asp 
480 

Gin Lys Ser Val 
495 

Ala Gin Val Pro 
510 

Gin Gin Tyr Phe 
525 

Asn Thr Leu Gly 

Val Gly Thr Glu 
560 

He Arg Ser Gly 
575 

Pro Ala Ala 
590 



<210> 47 
<211> 600 
<212> PRT 

<213> Exophiala spinifera 



<400> 47 
Met Ala Leu Ala 
1 

Ala Gly Tyr Ser 
20 

Thr He Ala Gly 
35 

Ala Tyr Glu Lys 
50 

Leu Ala Ala Val 
65 

Tyr He Val Asp 

Leu Lys Ala Thr 
100 

Pro Val Ser Ala 
115 



Pro Ser Tyr He 
5 

His Val Gly Val 

Gin He Gly Gin 
40 

Gin Val Ala Gin 
55 

Gly Ala Thr Ser 
70 

Tyr Ala Pro Ser 
85 

Phe Ala Leu Asp 

Leu Ser Ser Pro 
120 



Asn Pro Pro Asn 
10 

Gly Pro Asp Gly 
25 

Asp Ala Ser Gly 

Ala Phe Ala Asn 
60 

Asn Asp Val Thr 
75 

Lys Leu Thr Ala 
90 

Arg Leu Pro Pro 
105 

Glu Tyr Leu Phe 



Val Ala Ser Pro 
15 

Gly Arg Tyr Val 
30 

Val Thr Asp Pro 
45 

Leu Arg Ala Cys 

Lys Leu Asn Tyr 
80 

He Gly Asp Gly 
95 

Cys Thr Leu Val 
110 

Glu Val Asp Ala 
125 



76 



Thr Ala Leu Val 
130 

val Val Val Gly 

145 

Gin Ala Ala Gly 

Gly Gly Lys Thr 
180 

Asn Asp Leu Gly 
195 

Ser Arg Leu Phe 
210 

Thr Gly Asn Ser 
225 

Pro Tyr Gly Asp 

Glu Leu Leu Pro 
260 

Asp Leu Lys Ala 
275 

Ala His Tyr Cys 
290 

Ala Asn Gin lie 
305 

Ser Met Leu Phe 

Asn lie Phe Ser 
340 

Thr Gly Met Gin 
355 

Gly Ser Val His 
370 

Ser Gly Cys Thr 
385 

Lys Val Val Val 

Ser Pro Pro Leu 
420 

Leu Gly Tyr Tyr 
435 

Arg Glu Gin Gly 
450 

Ser Phe Ala Arg 
465 

Thr Cys Phe Met 

Lys Gin Val Arg 
500 

Glu Asn Ala Gly 
515 

Glu Trp Ser Lys 
530 

Gly Leu Asn Asp 
545 

Lys Ser Val His 

Tyr Met Glu Gly 
580 

Val Ala Ser Leu 
595 



Pro Gly His Thr 
135 

Ala Gly Leu Ser 
150 

Leu Ser Cys Leu 
165 

Leu Ser Val Gin 

Ala Ala Trp lie 
200 

Glu Arg Phe His 
215 

lie His Gin Ala 
230 

Ser Leu Leu Ser 
245 

Val Trp Ser Gin 

Ser Pro Gin Ala 
280 

Glu Lys Glu Leu 
295 

Thr Arg Ala Leu 
310 

Leu Thr Asp Tyr 
325 

Asp Lys Lys Asp 

Ser lie Cys His 
360 

Leu Asn Thr Pro 
375 

Val Arg Ser Ala 
390 

Ser Leu Pro Thr 
405 

Pro Ala Glu Lys 

Ser Lys lie Val 
440 

Phe Ser Gly Val 
455 

Asp Thr Ser lie 
470 

Val Gly Asp Pro 
485 

Gin Lys Ser Val 

Ala Gin Val Pro 
520 

Gin Gin Tyr Phe 
535 

Leu lie Thr Leu 
550 

Phe Val Gly Thr 
565 

Ala lie Arg Ser 

Val Pro Ala Ala 
600 



Thr Pro Asp Asn 
14 0 

Gly Leu Glu Thr 
155 

Val Leu Glu Ala 
170 

Ser Gly Pro Gly 
185 

Asn Asp Ser Asn 

Leu Glu Gly Glu 
220 

Gin Asp Gly Thr 
235 

Glu Glu Val Ala 
250 

Leu lie Glu Glu 
265 

Lys Arg Leu Asp 

Asn Leu Pro Ala 
300 

Leu Gly Val Glu 
315 

lie Lys Ser Ala 
330 

Gly Gly Gin Tyr 
345 

Ala Met Ser Lys 

Val Ala Glu lie 
380 

Ser Gly Ala Val 
395 

Thr Leu Tyr Pro 
410 

Gin Ala Leu Ala 
425 

Phe Val Trp Asp 

Leu Gin Ser Ser 
460 

Asp Val Asp Arg 
475 

Gly Arg Lys Trp 
490 

Trp Asp Gin Leu 
505 

Glu Pro Ala Asn 

Gin Gly Ala Pro 
540 

Gly Ser Ala Leu 
555 

Glu Thr Ser Leu 
570 

Gly Gin Arg Gly 
585 



Val Ala Asp Val 

Ala Arg Lys Val 
160 

Met Asp Arg Val 
175 

Arg Thr Thr lie 
190 

Gin Ser Glu Val 
205 

Leu Gin Arg Thr 

Thr Thr Thr Ala 
240 

Ser Ala Leu Ala 
255 

His Ser Leu Gin 
270 

Ser Val Ser Phe 
285 

Val Leu Gly Val 

Ala His Glu lie 
320 

Thr Gly Leu Ser 
335 

Met Arg Cys Lys 
350 

Glu Leu Val Pro 
365 

Glu Gin Ser Ala 

Phe Arg Ser Lys 
400 

Thr Leu Thr Phe 
415 

Glu Asn Ser lie 
430 

Lys Pro Trp Trp 
445 

Cys Asp Pro lie 

Gin Trp Ser lie 
480 

Ser Gin Gin Ser 
495 

Arg Ala Ala Tyr 
510 

Val Leu Glu lie 
525 

Ser Ala Val Tyr 

Arg Thr Pro Phe 
560 

Val Trp Lys Gly 
575 

Ala Ala Glu Val 
590 



<210> 48 



77 



<211> 1392 
<212> DNA 
<213> Unknown 



<220> 

<221> CDS 

<222> (1) . . . (1392) 

<223> Cys (-) APAO,- removal of cysteine 461 



gag ggc gag etc cag agg acg act gga aat tea ate cat caa gca caa 
Glu Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser He His Gin Ala Gin 
85 90 95 

gac ggt aca acc act aca get cct tat ggt gac tec ttg ctg age gag 
Asp Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu 
100 105 HO 

gag gtt gca agt gca ctt gcg gaa etc etc ccc gta tgg tct cag ctg 
Glu Val Ala Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu 
115 120 125 

ate gaa gag cat age ctt caa gac etc aag gcg age cct cag gcg aag 
He Glu Glu His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys 
130 135 140 

egg etc gac agt gtg age ttc gcg cac tac tgt gag aag gaa eta aac 
Arg Leu Asp Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn 
145 150 155 160 



aag agt gec acc ggt etc agt aat att ttc teg gac aag aaa gac ggc 
Lys Ser Ala Thr Gly Leu Ser Asn He Phe Ser Asp Lys Lys Asp Gly 
195 200 205 

78 



48 



96 



144 



<400> 48 

aaa gac aac gtt gcg gac gtg gta gtg gtg ggc get ggc ttg age ggt 
Lys Asp Asn Val Ala Asp Val Val Val Val Gly Ala Gly Leu Ser Gly 
15 10 15 

ttg gag acg gca cgc aaa gtc cag gee gec ggt ctg tec tgc etc gtt 
Leu Glu Thr Ala Arg Lys Val Gin Ala Ala Gly Leu Ser Cys Leu Val 
20 25 30 

ctt gag gcg atg gat cgt gta ggg gga aag act ctg age gta caa teg 
Leu Glu Ala Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser 
35 40 45 

ggt ccc ggc agg acg act ate aac gac etc ggc get gcg tgg ate aat 192 
Gly Pro Gly Arg Thr Thr He Asn Asp Leu Gly Ala Ala Trp He Asn 
50 55 60 

gac age aac caa age gaa gta tec aga ttg ttt gaa aga ttt cat ttg 
Asp Ser Asn Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu 
65 70 75 80 



240 



288 



336 



384 



432 



480 



c 528 



ttg cct get gtt etc ggc gta gca aac cag ate aca cgc get ctg ct 
Leu Pro Ala Val Leu Gly Val Ala Asn Gin He Thr Arg Ala Leu Leu 
165 170 175 

ggt gtg gaa gec cac gag ate age atg ctt ttt etc acc gac tac ate 576 
Gly Val Glu Ala His Glu He Ser Met Leu Phe Leu Thr Asp Tyr He 
180 185 190 



624 



ggg cag tat atg cga tgc aaa aca ggt atg cag teg att tgc cat gec 672 
Gly Gin Tyr Met Arg Cys Lys Thr Gly Met Gin Ser lie Cys His Ala 
210 215 220 

atg tea aag gaa ctt gtt cca ggc tea gtg cac etc aac acc ccc gtc 72 0 

Met Ser Lys Glu Leu Val Pro Gly Ser Val His Leu Asn Thr Pro Val 
225 230 235 240 

get gaa att gag cag teg gca tec ggc tgt aca gta cga teg gee teg 768 
Ala Glu lie Glu Gin Ser Ala Ser Gly Cys Thr Val Arg Ser Ala Ser 
245 250 255 

ggc gee gtg ttc cga age aaa aag gtg gtg gtt teg tta ccg aca acc 816 
Gly Ala Val Phe Arg Ser Lys Lys Val Val Val Ser Leu Pro Thr Thr 
260 265 270 

ttg tat ccc acc ttg aca ttt tea cca cct ctt ccc gee gag aag caa 864 
Leu Tyr Pro Thr Leu Thr Phe Ser Pro Pro Leu Pro Ala Glu Lys Gin 
275 280 285 

gca ttg gcg gaa aat tct ate ctg ggc tac tat age aag ata gtc ttc 912 
Ala Leu Ala Glu Asn Ser lie Leu Gly Tyr Tyr Ser Lys lie Val Phe 
290 295 300 

gta tgg gac aag ccg tgg tgg cgc gaa caa ggc ttc teg ggc gtc etc 96 0 

Val Trp Asp Lys Pro Trp Trp Arg Glu Gin Gly Phe Ser Gly Val Leu 
305 310 315 320 

caa teg age tec gac ccc ate tea ttt gee aga gat acc age ate gac 1008 
Gin Ser Ser Ser Asp Pro lie Ser Phe Ala Arg Asp Thr Ser lie Asp 
325 330 335 

gtc gat cga caa tgg tec att acc tgt ttc atg gtc gga gac ccg gga 1056 
Val Asp Arg Gin Trp Ser lie Thr Cys Phe Met Val Gly Asp Pro Gly 
340 345 350 

egg aag tgg tec caa cag tec aag cag gta cga caa aag tct gtc tgg 1104 
Arg Lys Trp Ser Gin Gin Ser Lys Gin Val Arg Gin Lys Ser Val Trp 
355 360 365 

gae caa etc cgc gca gee tac gag aac gee ggg gee caa gtc cca gag 1152 
Asp Gin Leu Arg Ala Ala Tyr Glu Asn Ala Gly Ala Gin Val Pro Glu 
370 375 380 

ccg gee aac gtg etc gaa ate gag tgg teg aag cag cag tat ttc caa 1200 
Pro Ala Asn Val Leu Glu lie Glu Trp Ser Lys Gin Gin Tyr Phe Gin 
385 390 395 400 

gga get ccg age gee gtc tat ggg ctg aac gat etc ate aca ctg ggt 124 8 

Gly Ala Pro Ser Ala Val Tyr Gly Leu Asn Asp Leu lie Thr Leu Gly 
405 410 415 

teg gcg etc aga acg ccg ttc aag agt gtt cat ttc gtt gga acg gag 12 96 

Ser Ala Leu Arg Thr Pro Phe Lys Ser Val His Phe Val Gly Thr Glu 
420 425 430 

acg tct tta gtt tgg aaa ggg tat atg gaa ggg gec ata cga teg ggt 1344 
Thr Ser Leu Val Trp Lys Gly Tyr Met Glu Gly Ala lie Arg Ser Gly 
435 440 445 

caa cga ggt get gca gaa gtt gtg get age ctg gtg cca gca gca tag 13 92 

79 



Gin Arg Gly Ala Ala Glu Val Val Ala Ser Leu Val Pro Ala Ala 
450 455 460 



<210> 49 
<211> 463 
<212> PRT 
<213> Unknown 



<220> 

<223> Cys (-) APAO; removal of cysteine 461 
<400> 49 

Lys Asp Asn Val Ala Asp Val Val Val Val Gly Ala Gly Leu Ser Gly 

15 10 15 

Leu Glu Thr Ala Arg Lys Val Gin Ala Ala Gly Leu Ser Cys Leu Val 

20 25 30 

Leu Glu Ala Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser 

35 40 45 

Gly Pro Gly Arg Thr Thr lie Asn Asp Leu Gly Ala Ala Trp lie Asn 

50 55 60 

Asp Ser Asn Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu 
65 70 75 80 

Glu Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser lie His Gin Ala Gin 

85 90 95 

Asp Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu 

100 105 HO 

Glu Val Ala Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu 

115 120 125 

lie Glu Glu His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys 

130 135 140 

Arg Leu Asp Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn 
145 150 155 160 

Leu Pro Ala Val Leu Gly Val Ala Asn Gin He Thr Arg Ala Leu Leu 

165 170 175 

Gly Val Glu Ala His Glu He Ser Met Leu Phe Leu Thr Asp Tyr He 

180 185 190 

Lys Ser Ala Thr Gly Leu Ser Asn He Phe Ser Asp Lys Lys Asp Gly 

195 200 205 

Gly Gin Tyr Met Arg Cys Lys Thr Gly Met Gin Ser He Cys His Ala 

210 215 220 

Met Ser Lys Glu Leu Val Pro Gly Ser Val His Leu Asn Thr Pro Val 
225 230 235 240 

Ala Glu He Glu Gin Ser Ala Ser Gly Cys Thr Val Arg Ser Ala Ser 

245 250 255 

Gly Ala Val Phe Arg Ser Lys Lys Val Val Val Ser Leu Pro Thr Thr 

260 265 270 

Leu Tyr Pro Thr Leu Thr Phe Ser Pro Pro Leu Pro Ala Glu Lys Gin 

275 280 285 

Ala Leu Ala Glu Asn Ser He Leu Gly Tyr Tyr Ser Lys He Val Phe 

290 295 300 

Val Trp Asp Lys Pro Trp Trp Arg Glu Gin Gly Phe Ser Gly Val Leu 
305 310 315 320 

Gin Ser Ser Ser Asp Pro He Ser Phe Ala Arg Asp Thr Ser He Asp 

325 330 335 

Val Asp Arg Gin Trp Ser He Thr Cys Phe Met Val Gly Asp Pro Gly 

340 345 350 

Arg Lys Trp Ser Gin Gin Ser Lys Gin Val Arg Gin Lys Ser Val Trp 

355 360 365 

Asp Gin Leu Arg Ala Ala Tyr Glu Asn Ala Gly Ala Gin Val Pro Glu 

370 375 380 

Pro Ala Asn Val Leu Glu He Glu Trp Ser Lys Gin Gin Tyr Phe Gin 

80 



385 

Gly Ala Pro Ser 

Ser Ala Leu Arg 
420 

Thr Ser Leu Val 
435 

Gin Arg Gly Ala 
450 



390 

Ala Val Tyr Gly 
405 

Thr Pro Phe Lys 

Trp Lys Gly Tyr 
440 

Ala Glu Val Val 
455 



395 

Leu Asn Asp Leu 
410 

Ser Val His Phe 
425 

Met Glu Gly Ala 

Ala Ser Leu Val 
460 



400 

lie Thr Leu Gly 
415 

Val Gly Thr Glu 
430 

lie Arg Ser Gly 
445 

Pro Ala Ala 



<210> 50 
<211> 1392 
<212> DNA 
<213> Unknown 



<220> 

<221> CDS 

<222> (1) . . . (1392) 



<223> Cys {-) APAO; removal of cysteines 359 and 461 



<400> 50 

aaa gac aac gtt gcg gac gtg gta gtg gtg ggc get ggc ttg age ggt 48 
Lys Asp Asn Val Ala Asp Val Val Val Val Gly Ala Gly Leu Ser Gly 
15 10 15 

ttg gag acg gca cgc aaa gtc cag gec gec ggt ctg tec tgc etc gtt 96 
Leu Glu Thr Ala Arg Lys Val Gin Ala Ala Gly Leu Ser Cys Leu Val 
20 25 30 

ctt gag gcg atg gat cgt gta ggg gga aag act ctg age gta caa teg 144 
Leu Glu Ala Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser 
35 40 45 

ggt ccc ggc agg acg act ate aac gac etc ggc get gcg tgg ate aat 192 
Gly Pro Gly Arg Thr Thr lie Asn Asp Leu Gly Ala Ala Trp lie Asn 
50 55 60 



240 



288 



gac age aac caa age gaa gta tec aga ttg ttt gaa aga ttt cat ttg 
Asp Ser Asn Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu 
65 70 75 80 

9 a 9 99 c 9 a 9" ctc ca 9" ac 9 act aat tca atc cat caa gca caa 

Glu Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser lie His Gin Ala Gin 
85 90 95 

gac ggt aca acc act aca get cct tat ggt gac tec ttg ctg age gag 33 6 

Asp Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu 
100 105 HO 

gag gtt gca agt gca ctt gcg gaa ctc ctc ccc gta tgg tct cag ctg 3 84 

Glu Val Ala Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu 
115 120 125 

atc gaa gag cat age ctt caa gac ctc aag gcg age cct cag gcg aag 
lie Glu Glu His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys 
130 135 140 

egg ctc gac agt gtg age ttc gcg cac tac tgt gag aag gaa eta aac 
Arg Leu Asp Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn 
145 150 155 160 



432 



480 



81 



ttg cct get gtt etc ggc gta gca aac cag ate aca cgc get ctg etc 52 8 

Leu Pro Ala Val Leu Gly Val Ala Asn Gin He Thr Arg Ala Leu Leu 
165 170 175 

ggt gtg gaa gec cac gag ate age atg ctt ttt etc ace gac tac ate 576 
Gly Val Glu Ala His Glu He Ser Met Leu Phe Leu Thr Asp Tyr He 
180 185 190 



aag agt gee acc ggt etc agt aat att ttc teg gac aag aaa gac ggc 
Lys Ser Ala Thr Gly Leu Ser Asn He Phe Ser Asp Lys Lys Asp Gly 
195 200 205 



gac caa etc cgc gca gee tac gag aac gec ggg gee caa gtc cca gag 
Asp Gin Leu Arg Ala Ala Tyr Glu Asn Ala Gly Ala Gin Val Pro Glu 
370 375 380 

ccg gee aac gtg etc gaa ate gag tgg teg aag cag cag tat ttc caa 
Pro Ala Asn Val Leu Glu He Glu Trp Ser Lys Gin Gin Tyr Phe Gin 
385 390 395 400 

gga get ccg age gec gtc tat ggg ctg aac gat etc ate aca ctg ggt 
Gly Ala Pro Ser Ala Val Tyr Gly Leu Asn Asp Leu He Thr Leu Gly 

82 



624 



ggg cag tat atg cga tgc aaa aca ggt atg cag teg att teg cat gee 672 
Gly Gin Tyr Met Arg Cys Lys Thr Gly Met Gin Ser He Ser His Ala 
210 215 220 

atg tea aag gaa ctt gtt cca ggc tea gtg cac etc aac acc ccc gtc 72 0 

Met Ser Lys Glu Leu Val Pro Gly Ser Val His Leu Asn Thr Pro Val 
225 230 235 240 

get gaa att gag cag teg gca tec ggc tgt aca gta cga teg gec teg 768 
Ala Glu He Glu Gin Ser Ala Ser Gly Cys Thr Val Arg Ser Ala Ser 
245 250 255 

ggc gec gtg ttc cga age aaa aag gtg gtg gtt teg tta ccg aca acc 816 
Gly Ala Val Phe Arg Ser Lys Lys Val Val Val Ser Leu Pro Thr Thr 
260 265 270 

ttg tat ccc acc ttg aca ttt tea cca cct ctt ccc gee gag aag caa 864 
Leu Tyr Pro Thr Leu Thr Phe Ser Pro Pro Leu Pro Ala Glu Lys Gin 
275 280 285 

gca ttg gcg gaa aat tct ate ctg ggc tac tat age aag ata gtc ttc 912 
Ala Leu Ala Glu Asn Ser He Leu Gly Tyr Tyr Ser Lys He Val Phe 
290 295 300 



gta tgg gac aag ccg tgg tgg cgc gaa caa ggc ttc teg ggc gtc etc 96 0 

Val Trp Asp Lys Pro Trp Trp Arg Glu Gin Gly Phe Ser Gly Val Leu 
305 310 315 320 

caa teg age tec gac ccc ate tea ttt gee aga gat acc age ate gac 1008 
Gin Ser Ser Ser Asp Pro He Ser Phe Ala Arg Asp Thr Ser He Asp 
325 330 335 

gtc gat cga caa tgg tec att acc tgt ttc atg gtc gga gac ccg gga 1056 
Val Asp Arg Gin Trp Ser He Thr Cys Phe Met Val Gly Asp Pro Gly 
340 345 350 

egg aag tgg tec caa cag tec aag cag gta cga caa aag tct gtc tgg 1104 
Arg Lys Trp Ser Gin Gin Ser Lys Gin Val Arg Gin Lys Ser Val Trp 
355 360 365 



1152 



1200 



1248 



405 



410 



415 



teg gcg etc aga acg ccg ttc aag agt gtt cat ttc gtt gga acg gag 1296 
Ser Ala Leu Arg Thr Pro Phe Lys Ser Val His Phe Val Gly Thr Glu 
420 425 430 

acg tct tta gtt tgg aaa ggg tat atg gaa ggg gec ata cga teg ggt 1344 
Thr Ser Leu Val Trp Lys Gly Tyr Met Glu Gly Ala lie Arg Ser Gly 
435 440 445 

caa cga ggt get gca gaa gtt gtg get age ctg gtg cca gca gca tag 1392 
Gin Arg Gly Ala Ala Glu Val Val Ala Ser Leu Val Pro Ala Ala * 
450 455 460 



<210> 51 

<211> 463 

<212> PRT 

<213> Unknown 

<220> 

<223> Cys (-) APAO; removal of cysteines 359 and 461 
<400> 51 

Lys Asp Asn Val Ala Asp Val Val Val Val Gly Ala Gly Leu Ser Gly 

15 10 15 

Leu Glu Thr Ala Arg Lys Val Gin Ala Ala Gly Leu Ser Cys Leu Val 

20 25 30 

Leu Glu Ala Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser 

35 40 45 

Gly Pro Gly Arg Thr Thr lie Asn Asp Leu Gly Ala Ala Trp He Asn 

50 55 60 

Asp Ser Asn Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu 
65 70 75 80 

Glu Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser He His Gin Ala Gin 

85 90 95 

Asp Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu 

100 105 HO 

Glu Val Ala Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu 

115 120 125 

He Glu Glu His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys 

130 135 140 

Arg Leu Asp Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn 
145 150 155 160 

Leu Pro Ala Val Leu Gly Val Ala Asn Gin He Thr Arg Ala Leu Leu 

165 170 175 

Gly Val Glu Ala His Glu He Ser Met Leu Phe Leu Thr Asp Tyr He 

180 185 190 

Lys Ser Ala Thr Gly Leu Ser Asn He Phe Ser Asp Lys Lys Asp Gly 

195 200 205 

Gly Gin Tyr Met Arg Cys Lys Thr Gly Met Gin Ser He Ser His Ala 

210 215 220 

Met Ser Lys Glu Leu Val Pro Gly Ser Val His Leu Asn Thr Pro Val 
225 230 235 240 

Ala Glu He Glu Gin Ser Ala Ser Gly Cys Thr Val Arg Ser Ala Ser 

245 250 255 

Gly Ala Val Phe Arg Ser Lys Lys Val Val Val Ser Leu Pro Thr Thr 

260 265 270 

Leu Tyr Pro Thr Leu Thr Phe Ser Pro Pro Leu Pro Ala Glu Lys Gin 

275 280 285 

Ala Leu Ala Glu Asn Ser He Leu Gly Tyr Tyr Ser Lys He Val Phe 
290 295 300 

83 



Val Trp Asp Lys 
305 

Gin Ser Ser Ser 

Val Asp Arg Gin 
340 

Arg Lys Trp Ser 
355 

Asp Gin Leu Arg 
370 

Pro Ala Asn Val 
385 

Gly Ala Pro Ser 

Ser Ala Leu Arg 
420 

Thr Ser Leu Val 
435 

Gin Arg Gly Ala 
450 



Pro Trp Trp Arg 
310 

Asp Pro lie Ser 
325 

Trp Ser lie Thr 

Gin Gin Ser Lys 
360 

Ala Ala Tyr Glu 
375 

Leu Glu lie Glu 
390 

Ala Val Tyr Gly 
405 

Thr Pro Phe Lys 

Trp Lys Gly Tyr 
440 

Ala Glu Val Val 
455 



Glu Gin Gly Phe 
315 

Phe Ala Arg Asp 
330 

Cys Phe Met Val 
345 

Gin Val Arg Gin 

Asn Ala Gly Ala 
380 

Trp Ser Lys Gin 
395 

Leu Asn Asp Leu 
410 

Ser Val His Phe 
425 

Met Glu Gly Ala 

Ala Ser Leu Val 
460 



Ser Gly Val Leu 
320 

Thr Ser lie Asp 
335 

Gly Asp Pro Gly 
350 

Lys Ser Val Trp 
365 

Gin Val Pro Glu 

Gin Tyr Phe Gin 
400 

lie Thr Leu Gly 
415 

Val Gly Thr Glu 
430 

lie Arg Ser Gly 
445 

Pro Ala Ala 



<210> 52 
<211> 1392 
<212> DNA 
<213> Unknown 



<220> 

<221> CDS 

<222> (1) . . . (1392) 

<223> Cys (-) APAO; removal of cysteines 169, 359, and 
461 



<400> 52 

aaa gac aac gtt gcg gac gtg gta gtg gtg ggc get ggc ttg age ggt 
Lys Asp Asn Val Ala Asp Val Val Val Val Gly Ala Gly Leu Ser Gly 
5 10 15 



1 



48 



96 



144 



ttg gag acg gca cgc aaa gtc cag gec gec ggt ctg age tec etc gtt 
Leu Glu Thr Ala Arg Lys Val Gin Ala Ala Gly Leu Ser Ser Leu Val 
20 25 30 

ctt gag gcg atg gat cgt gta ggg gga aag act ctg age gta caa teg 
Leu Glu Ala Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser 
35 40 45 

ggt ccc ggc agg acg act ate aac gac etc ggc get gcg tgg ate aat 192 
Gly Pro Gly Arg Thr Thr He Asn Asp Leu Gly Ala Ala Trp He Asn 
50 55 60 

gac age aac caa age gaa gta tec aga ttg ttt gaa aga ttt cat ttg 24 0 

Asp Ser Asn Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu 

70 75 80 



65 



gag ggc gag etc cag agg acg act gga aat tea ate cat caa gca caa 
Glu Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser He His Gin Ala Gin 
85 90 95 



288 



gac ggt aca acc act aca get cct tat ggt gac tec ttg ctg age gag 336 
Asp Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu 
100 105 HO 



84 



gag gtt gca agt gca ctt gcg gaa etc etc ccc gta tgg tct cag ctg 384 
Glu Val Ala Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu 
115 120 125 

ate gaa gag cat age ctt caa gac etc aag gcg age cct cag gcg aag 432 
lie Glu Glu His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys 
130 135 140 

egg etc gac agt gtg age ttc gcg cac tac tgt gag aag gaa eta aac 48 0 

Arg Leu Asp Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn 
145 150 155 160 

ttg cct get gtt etc ggc gta gca aac cag ate aca cgc get ctg etc 528 
Leu Pro Ala Val Leu Gly Val Ala Asn Gin lie Thr Arg Ala Leu Leu 
165 170 175 

ggt gtg gaa gee cac gag ate age atg ctt ttt etc ace gac tac ate 576 
Gly Val Glu Ala His Glu lie Ser Met Leu Phe Leu Thr Asp Tyr He 
180 185 190 

aag agt gee acc ggt etc agt aat att ttc teg gac aag aaa gac ggc 624 
Lys Ser Ala Thr Gly Leu Ser Asn He Phe Ser Asp Lys Lys Asp Gly 
195 200 205 

ggg cag tat atg cga tgc aaa aca ggt atg cag teg att teg cat gee 672 
Gly Gin Tyr Met Arg Cys Lys Thr Gly Met Gin Ser He Ser His Ala 
210 215 220 

atg tea aag gaa ctt gtt cca ggc tea gtg cac etc aac acc ccc gtc 720 
Met Ser Lys Glu Leu Val Pro Gly Ser Val His Leu Asn Thr Pro Val 
225 230 235 240 

get gaa att gag cag teg gca tec ggc tgt aca gta cga teg gee teg 
Ala Glu He Glu Gin Ser Ala Ser Gly Cys Thr Val Arg Ser Ala Ser 
245 250 255 

ggc gee gtg ttc cga age aaa aag gtg gtg gtt teg tta ccg aca acc 
Gly Ala Val Phe Arg Ser Lys Lys Val Val Val Ser Leu Pro Thr Thr 
260 265 270 

ttg tat ccc acc ttg aca ttt tea cca cct ctt ccc gec gag aag caa 
Leu Tyr Pro Thr Leu Thr Phe Ser Pro Pro Leu Pro Ala Glu Lys Gin 
275 280 285 

gca ttg gcg gaa aat tct ate ctg ggc tac tat age aag ata gtc ttc 
Ala Leu Ala Glu Asn Ser He Leu Gly Tyr Tyr Ser Lys He Val Phe 
290 295 300 

gta tgg gac aag ccg tgg tgg cgc gaa caa ggc ttc teg ggc gtc etc 
Val Trp Asp Lys Pro Trp Trp Arg Glu Gin Gly Phe Ser Gly Val Leu 
305 310 315 320 

caa teg age tec gac ccc ate tea ttt gee aga gat acc age ate gac 
Gin Ser Ser Ser Asp Pro He Ser Phe Ala Arg Asp Thr Ser He Asp 
325 330 335 

gtc gat cga caa tgg tec att acc tgt ttc atg gtc gga gac ccg gga 1056 
Val Asp Arg Gin Trp Ser He Thr Cys Phe Met Val Gly Asp Pro Gly 
340 345 350 



768 



816 



864 



912 



960 



1008 



egg aag tgg tec caa cag tec aag cag gta cga caa aag tct gtc tgg 
Arg Lys Trp Ser Gin Gin Ser Lys Gin Val Arg Gin Lys Ser Val Trp 

85 



1104 



355 



360 



365 



gac caa etc cgc gca gec tac gag aac gec ggg gec caa gtc cca gag 1152 
Asp Gin Leu Arg Ala Ala Tyr Glu Asn Ala Gly Ala Gin Val Pro Glu 
370 375 380 

ccg gec aac gtg etc gaa ate gag tgg teg aag cag cag tat ttc caa 
Pro Ala Asn Val Leu Glu lie Glu Trp Ser Lys Gin Gin Tyr Phe Gin 
385 390 395 400 

gga get ccg age gec gtc tat ggg ctg aac gat etc ate aca ctg ggt 
Gly Ala Pro Ser Ala Val Tyr Gly Leu Asn Asp Leu He Thr Leu Gly 
405 410 415 

teg gcg etc aga acg ccg ttc aag agt gtt cat ttc gtt gga acg gag 
Ser Ala Leu Arg Thr Pro Phe Lys Ser Val His Phe Val Gly Thr Glu 
420 425 430 

acg tct tta gtt tgg aaa ggg tat atg gaa ggg gec ata cga teg ggt 
Thr Ser Leu Val Trp Lys Gly Tyr Met Glu Gly Ala He Arg Ser Gly 
435 440 445 

caa cga ggt get gca gaa gtt gtg get age ctg gtg cca gca gca tag 13 92 

Gin Arg Gly Ala Ala Glu Val Val Ala Ser Leu Val Pro Ala Ala * 



1200 



1248 



1296 



1344 



450 



455 460 



<210> 53 
<211> 463 
<212> PRT 
<213> Unknown 

<220> 

<223> Cys {-) APAO; removal of cysteines 169, 359, and 
461 

<400> 53 

Lys Asp Asn Val Ala Asp Val Val Val Val Gly Ala Gly Leu Ser Gly 

15 10 15 

Leu Glu Thr Ala Arg Lys Val Gin Ala Ala Gly Leu Ser Ser Leu Val 

20 25 30 

Leu Glu Ala Met Asp Arg Val Gly Gly Lys Thr Leu Ser Val Gin Ser 

35 40 45 

Gly Pro Gly Arg Thr Thr He Asn Asp Leu Gly Ala Ala Trp He Asn 

50 55 60 

Asp Ser Asn Gin Ser Glu Val Ser Arg Leu Phe Glu Arg Phe His Leu 
65 70 75 80 

Glu Gly Glu Leu Gin Arg Thr Thr Gly Asn Ser He His Gin Ala Gin 

85 90 95 

Asp Gly Thr Thr Thr Thr Ala Pro Tyr Gly Asp Ser Leu Leu Ser Glu 

100 105 HO 

Glu Val Ala Ser Ala Leu Ala Glu Leu Leu Pro Val Trp Ser Gin Leu 

115 120 125 

He Glu Glu His Ser Leu Gin Asp Leu Lys Ala Ser Pro Gin Ala Lys 

130 135 140 

Arg Leu Asp Ser Val Ser Phe Ala His Tyr Cys Glu Lys Glu Leu Asn 
145 150 155 160 

Leu Pro Ala Val Leu Gly Val Ala Asn Gin He Thr Arg Ala Leu Leu 

165 170 175 

Gly Val Glu Ala His Glu He Ser Met Leu Phe Leu Thr Asp Tyr He 

180 185 190 

Lys Ser Ala Thr Gly Leu Ser Asn He Phe Ser Asp Lys Lys Asp Gly 

86 



195 

Gly Gin Tyr Met 
210 

Met Ser Lys Glu 
225 

Ala Glu lie Glu 

Gly Ala Val Phe 
260 

Leu Tyr Pro Thr 
275 

Ala Leu Ala Glu 
290 

Val Trp Asp Lys 
305 

Gin Ser Ser Ser 

Val Asp Arg Gin 
340 

Arg Lys Trp Ser 
355 

Asp Gin Leu Arg 
370 

Pro Ala Asn Val 
385 

Gly Ala Pro Ser 

Ser Ala Leu Arg 
420 

Thr Ser Leu Val 
435 

Gin Arg Gly Ala 
450 



200 

Arg Cys Lys Thr 
215 

Leu Val Pro Gly 
230 

Gin Ser Ala Ser 
245 

Arg Ser Lys Lys 

Leu Thr Phe Ser 
280 

Asn Ser lie Leu 
295 

Pro Trp Trp Arg 
310 

Asp Pro lie Ser 
325 

Trp Ser lie Thr 

Gin Gin Ser Lys 
360 

Ala Ala Tyr Glu 
375 

Leu Glu lie Glu 
390 

Ala Val Tyr Gly 
405 

Thr Pro Phe Lys 

Trp Lys Gly Tyr 
440 

Ala Glu Val Val 
455 



Gly Met Gin Ser 
220 

Ser Val His Leu 
235 

Gly Cys Thr Val 
250 

Val Val Val Ser 
265 

Pro Pro Leu Pro 

Gly Tyr Tyr Ser 
300 

Glu Gin Gly Phe 
315 

Phe Ala Arg Asp 
330 

Cys Phe Met Val 
345 

Gin Val Arg Gin 

Asn Ala Gly Ala 
380 

Trp Ser Lys Gin 
395 

Leu Asn Asp Leu 
410 

Ser Val His Phe 
425 

Met Glu Gly Ala 

Ala Ser Leu Val 
460 



205 

lie Ser His Ala 

Asn Thr Pro Val 
240 

Arg Ser Ala Ser 
255 

Leu Pro Thr Thr 
270 

Ala Glu Lys Gin 
285 

Lys lie Val Phe 

Ser Gly Val Leu 
320 

Thr Ser lie Asp 
335 

Gly Asp Pro Gly 
350 

Lys Ser Val Trp 
365 

Gin Val Pro Glu 

Gin Tyr Phe Gin 
400 

lie Thr Leu Gly 
415 

Val Gly Thr Glu 
430 

lie Arg Ser Gly 
445 

Pro Ala Ala 



87 



